Dataset statistics
| Number of variables | 27 |
|---|---|
| Number of observations | 45346 |
| Missing cells | 239390 |
| Missing cells (%) | 19.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 9.3 MiB |
| Average record size in memory | 216.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 17 |
title has a high cardinality: 42196 distinct values | High cardinality |
overview has a high cardinality: 44231 distinct values | High cardinality |
original_language has a high cardinality: 89 distinct values | High cardinality |
tagline has a high cardinality: 20268 distinct values | High cardinality |
name_btc has a high cardinality: 1078 distinct values | High cardinality |
poster_btc has a high cardinality: 1078 distinct values | High cardinality |
backdrop_btc has a high cardinality: 1077 distinct values | High cardinality |
iso_639_1 has a high cardinality: 1916 distinct values | High cardinality |
language_name has a high cardinality: 1827 distinct values | High cardinality |
companies_id has a high cardinality: 22290 distinct values | High cardinality |
companies_name has a high cardinality: 22240 distinct values | High cardinality |
countries_iso has a high cardinality: 2383 distinct values | High cardinality |
countries_name has a high cardinality: 2383 distinct values | High cardinality |
release_date has a high cardinality: 17333 distinct values | High cardinality |
popularity is highly overall correlated with vote_count | High correlation |
vote_count is highly overall correlated with popularity and 1 other fields | High correlation |
budget is highly overall correlated with revenue and 1 other fields | High correlation |
revenue is highly overall correlated with vote_count and 2 other fields | High correlation |
return is highly overall correlated with budget and 1 other fields | High correlation |
status is highly imbalanced (97.0%) | Imbalance |
original_language is highly imbalanced (67.4%) | Imbalance |
iso_639_1 is highly imbalanced (62.0%) | Imbalance |
language_name is highly imbalanced (62.0%) | Imbalance |
countries_iso is highly imbalanced (57.7%) | Imbalance |
countries_name is highly imbalanced (57.7%) | Imbalance |
overview has 946 (2.1%) missing values | Missing |
tagline has 24960 (55.0%) missing values | Missing |
id_btc has 42183 (93.0%) missing values | Missing |
name_btc has 42183 (93.0%) missing values | Missing |
poster_btc has 42183 (93.0%) missing values | Missing |
backdrop_btc has 42183 (93.0%) missing values | Missing |
iso_639_1 has 3792 (8.4%) missing values | Missing |
language_name has 3915 (8.6%) missing values | Missing |
companies_id has 12264 (27.0%) missing values | Missing |
companies_name has 12264 (27.0%) missing values | Missing |
countries_iso has 6213 (13.7%) missing values | Missing |
countries_name has 6213 (13.7%) missing values | Missing |
popularity is highly skewed (γ1 = 29.21542294) | Skewed |
return is highly skewed (γ1 = 138.283787) | Skewed |
title is uniformly distributed | Uniform |
overview is uniformly distributed | Uniform |
tagline is uniformly distributed | Uniform |
id has unique values | Unique |
vote_average has 2944 (6.5%) zeros | Zeros |
vote_count has 2846 (6.3%) zeros | Zeros |
runtime has 1781 (3.9%) zeros | Zeros |
budget has 36470 (80.4%) zeros | Zeros |
revenue has 37949 (83.7%) zeros | Zeros |
return has 40033 (88.3%) zeros | Zeros |
Reproduction
| Analysis started | 2023-07-01 12:46:37.275489 |
|---|---|
| Analysis finished | 2023-07-01 12:47:50.457230 |
| Duration | 1 minute and 13.18 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
id
Real number (ℝ)
| Distinct | 45346 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 108042.22 |
| Minimum | 2 |
|---|---|
| Maximum | 469172 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.4 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 5340.25 |
| Q1 | 26390.25 |
| median | 59852.5 |
| Q3 | 156601.5 |
| 95-th percentile | 357370.75 |
| Maximum | 469172 |
| Range | 469170 |
| Interquartile range (IQR) | 130211.25 |
Descriptive statistics
| Standard deviation | 112187.33 |
|---|---|
| Coefficient of variation (CV) | 1.0383656 |
| Kurtosis | 0.55836782 |
| Mean | 108042.22 |
| Median Absolute Deviation (MAD) | 44405 |
| Skewness | 1.2828454 |
| Sum | 4.8992825 × 109 |
| Variance | 1.2585996 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 862 | 1 | < 0.1% |
| 202198 | 1 | < 0.1% |
| 124026 | 1 | < 0.1% |
| 300168 | 1 | < 0.1% |
| 132316 | 1 | < 0.1% |
| 74458 | 1 | < 0.1% |
| 40777 | 1 | < 0.1% |
| 188222 | 1 | < 0.1% |
| 328483 | 1 | < 0.1% |
| 107637 | 1 | < 0.1% |
| Other values (45336) | 45336 |
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 3 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 11 | 1 | |
| 12 | 1 | |
| 13 | 1 | |
| 14 | 1 | |
| 15 | 1 | |
| 16 | 1 |
| Value | Count | Frequency (%) |
| 469172 | 1 | |
| 468707 | 1 | |
| 468343 | 1 | |
| 467731 | 1 | |
| 465044 | 1 | |
| 464819 | 1 | |
| 464207 | 1 | |
| 464111 | 1 | |
| 463906 | 1 | |
| 463800 | 1 |
title
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 42196 |
|---|---|
| Distinct (%) | 93.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 354.4 KiB |
| Cinderella | 11 |
|---|---|
| Hamlet | 9 |
| Alice in Wonderland | 9 |
| Beauty and the Beast | 8 |
| Les Misérables | 8 |
| Other values (42191) |
Length
| Max length | 105 |
|---|---|
| Median length | 79 |
| Mean length | 16.702289 |
| Min length | 1 |
Characters and Unicode
| Total characters | 757382 |
|---|---|
| Distinct characters | 287 |
| Distinct categories | 17 ? |
| Distinct scripts | 7 ? |
| Distinct blocks | 12 ? |
Unique
| Unique | 39892 ? |
|---|---|
| Unique (%) | 88.0% |
Sample
| 1st row | Toy Story |
|---|---|
| 2nd row | Jumanji |
| 3rd row | Grumpier Old Men |
| 4th row | Waiting to Exhale |
| 5th row | Father of the Bride Part II |
Common Values
| Value | Count | Frequency (%) |
| Cinderella | 11 | < 0.1% |
| Hamlet | 9 | < 0.1% |
| Alice in Wonderland | 9 | < 0.1% |
| Beauty and the Beast | 8 | < 0.1% |
| Les Misérables | 8 | < 0.1% |
| Treasure Island | 7 | < 0.1% |
| The Three Musketeers | 7 | < 0.1% |
| A Christmas Carol | 7 | < 0.1% |
| Bluebeard | 6 | < 0.1% |
| The Hound of the Baskervilles | 6 | < 0.1% |
| Other values (42186) | 45268 |
Length
| Value | Count | Frequency (%) |
| the | 14544 | 10.7% |
| of | 4923 | 3.6% |
| a | 2238 | 1.6% |
| in | 1693 | 1.2% |
| and | 1629 | 1.2% |
| to | 1053 | 0.8% |
| 756 | 0.6% | |
| man | 665 | 0.5% |
| love | 664 | 0.5% |
| for | 601 | 0.4% |
| Other values (24353) | 107329 |
Most occurring characters
| Value | Count | Frequency (%) |
| 90771 | 12.0% | |
| e | 76195 | 10.1% |
| a | 48911 | 6.5% |
| o | 45636 | 6.0% |
| n | 40797 | 5.4% |
| r | 39993 | 5.3% |
| i | 39748 | 5.2% |
| t | 36706 | 4.8% |
| s | 29500 | 3.9% |
| h | 28499 | 3.8% |
| Other values (277) | 280626 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 533789 | |
| Uppercase Letter | 117198 | 15.5% |
| Space Separator | 90771 | 12.0% |
| Other Punctuation | 10485 | 1.4% |
| Decimal Number | 3845 | 0.5% |
| Dash Punctuation | 980 | 0.1% |
| Close Punctuation | 87 | < 0.1% |
| Open Punctuation | 85 | < 0.1% |
| Final Punctuation | 38 | < 0.1% |
| Other Letter | 25 | < 0.1% |
| Other values (7) | 79 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 76195 | |
| a | 48911 | |
| o | 45636 | 8.5% |
| n | 40797 | 7.6% |
| r | 39993 | 7.5% |
| i | 39748 | 7.4% |
| t | 36706 | 6.9% |
| s | 29500 | 5.5% |
| h | 28499 | 5.3% |
| l | 25904 | 4.9% |
| Other values (121) | 121900 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 16010 | |
| S | 10332 | 8.8% |
| M | 8029 | 6.9% |
| B | 7653 | 6.5% |
| C | 7157 | 6.1% |
| A | 6782 | 5.8% |
| D | 6330 | 5.4% |
| L | 5869 | 5.0% |
| H | 5170 | 4.4% |
| W | 5162 | 4.4% |
| Other values (65) | 38704 |
Other Letter
| Value | Count | Frequency (%) |
| چ | 2 | 8.0% |
| ه | 2 | 8.0% |
| ک | 2 | 8.0% |
| ی | 2 | 8.0% |
| 傳 | 1 | 4.0% |
| 空 | 1 | 4.0% |
| 時 | 1 | 4.0% |
| 狗 | 1 | 4.0% |
| 貓 | 1 | 4.0% |
| ª | 1 | 4.0% |
| Other values (11) | 11 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 3714 | |
| ' | 2505 | |
| . | 1603 | |
| , | 1133 | 10.8% |
| ! | 647 | 6.2% |
| & | 458 | 4.4% |
| ? | 269 | 2.6% |
| / | 79 | 0.8% |
| * | 19 | 0.2% |
| # | 13 | 0.1% |
| Other values (8) | 45 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 861 | |
| 1 | 695 | |
| 0 | 616 | |
| 3 | 482 | |
| 9 | 229 | 6.0% |
| 4 | 228 | 5.9% |
| 5 | 224 | 5.8% |
| 7 | 193 | 5.0% |
| 8 | 161 | 4.2% |
| 6 | 156 | 4.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 17 | |
| × | 3 | 12.5% |
| ∞ | 1 | 4.2% |
| = | 1 | 4.2% |
| → | 1 | 4.2% |
| − | 1 | 4.2% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 12 | |
| ² | 3 | 15.8% |
| ³ | 2 | 10.5% |
| ⅓ | 1 | 5.3% |
| ⁴ | 1 | 5.3% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 3 | |
| ☆ | 2 | |
| ™ | 1 | 12.5% |
| ♡ | 1 | 12.5% |
| № | 1 | 12.5% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 18 | |
| ¢ | 2 | 9.5% |
| £ | 1 | 4.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 965 | |
| – | 15 | 1.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 82 | |
| ] | 5 | 5.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 80 | |
| [ | 5 | 5.9% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 37 | |
| ” | 1 | 2.6% |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 1 | |
| “ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 90771 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3 |
Format
| Value | Count | Frequency (%) |
| | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 650472 | |
| Common | 106370 | 14.0% |
| Cyrillic | 346 | < 0.1% |
| Greek | 170 | < 0.1% |
| Arabic | 11 | < 0.1% |
| Katakana | 8 | < 0.1% |
| Han | 5 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 76195 | 11.7% |
| a | 48911 | 7.5% |
| o | 45636 | 7.0% |
| n | 40797 | 6.3% |
| r | 39993 | 6.1% |
| i | 39748 | 6.1% |
| t | 36706 | 5.6% |
| s | 29500 | 4.5% |
| h | 28499 | 4.4% |
| l | 25904 | 4.0% |
| Other values (107) | 238583 |
Common
| Value | Count | Frequency (%) |
| 90771 | ||
| : | 3714 | 3.5% |
| ' | 2505 | 2.4% |
| . | 1603 | 1.5% |
| , | 1133 | 1.1% |
| - | 965 | 0.9% |
| 2 | 861 | 0.8% |
| 1 | 695 | 0.7% |
| ! | 647 | 0.6% |
| 0 | 616 | 0.6% |
| Other values (50) | 2860 | 2.7% |
Cyrillic
| Value | Count | Frequency (%) |
| о | 32 | 9.2% |
| е | 32 | 9.2% |
| а | 29 | 8.4% |
| н | 24 | 6.9% |
| и | 23 | 6.6% |
| р | 22 | 6.4% |
| к | 17 | 4.9% |
| с | 15 | 4.3% |
| в | 14 | 4.0% |
| т | 14 | 4.0% |
| Other values (38) | 124 |
Greek
| Value | Count | Frequency (%) |
| α | 20 | 11.8% |
| ο | 14 | 8.2% |
| ι | 14 | 8.2% |
| τ | 9 | 5.3% |
| λ | 8 | 4.7% |
| ά | 8 | 4.7% |
| ρ | 8 | 4.7% |
| ν | 7 | 4.1% |
| π | 6 | 3.5% |
| ς | 6 | 3.5% |
| Other values (32) | 70 |
Katakana
| Value | Count | Frequency (%) |
| テ | 1 | |
| ポ | 1 | |
| ィ | 1 | |
| ス | 1 | |
| タ | 1 | |
| ン | 1 | |
| ァ | 1 | |
| フ | 1 |
Arabic
| Value | Count | Frequency (%) |
| چ | 2 | |
| ه | 2 | |
| ک | 2 | |
| ی | 2 | |
| س | 1 | |
| ا | 1 | |
| ج | 1 |
Han
| Value | Count | Frequency (%) |
| 傳 | 1 | |
| 空 | 1 | |
| 時 | 1 | |
| 狗 | 1 | |
| 貓 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 755820 | |
| None | 1121 | 0.1% |
| Cyrillic | 346 | < 0.1% |
| Punctuation | 62 | < 0.1% |
| Arabic | 11 | < 0.1% |
| Katakana | 8 | < 0.1% |
| CJK | 5 | < 0.1% |
| Misc Symbols | 3 | < 0.1% |
| Letterlike Symbols | 2 | < 0.1% |
| Math Operators | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 90771 | 12.0% | |
| e | 76195 | 10.1% |
| a | 48911 | 6.5% |
| o | 45636 | 6.0% |
| n | 40797 | 5.4% |
| r | 39993 | 5.3% |
| i | 39748 | 5.3% |
| t | 36706 | 4.9% |
| s | 29500 | 3.9% |
| h | 28499 | 3.8% |
| Other values (76) | 279064 |
None
| Value | Count | Frequency (%) |
| é | 216 | |
| ä | 127 | 11.3% |
| ö | 55 | 4.9% |
| è | 53 | 4.7% |
| ô | 44 | 3.9% |
| ü | 39 | 3.5% |
| ó | 37 | 3.3% |
| ı | 35 | 3.1% |
| á | 35 | 3.1% |
| í | 33 | 2.9% |
| Other values (108) | 447 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 37 | |
| – | 15 | |
| … | 5 | 8.1% |
| | 2 | 3.2% |
| ‘ | 1 | 1.6% |
| ” | 1 | 1.6% |
| “ | 1 | 1.6% |
Cyrillic
| Value | Count | Frequency (%) |
| о | 32 | 9.2% |
| е | 32 | 9.2% |
| а | 29 | 8.4% |
| н | 24 | 6.9% |
| и | 23 | 6.6% |
| р | 22 | 6.4% |
| к | 17 | 4.9% |
| с | 15 | 4.3% |
| в | 14 | 4.0% |
| т | 14 | 4.0% |
| Other values (38) | 124 |
Arabic
| Value | Count | Frequency (%) |
| چ | 2 | |
| ه | 2 | |
| ک | 2 | |
| ی | 2 | |
| س | 1 | |
| ا | 1 | |
| ج | 1 |
Misc Symbols
| Value | Count | Frequency (%) |
| ☆ | 2 | |
| ♡ | 1 |
CJK
| Value | Count | Frequency (%) |
| 傳 | 1 | |
| 空 | 1 | |
| 時 | 1 | |
| 狗 | 1 | |
| 貓 | 1 |
Number Forms
| Value | Count | Frequency (%) |
| ⅓ | 1 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 1 | |
| № | 1 |
Math Operators
| Value | Count | Frequency (%) |
| ∞ | 1 | |
| − | 1 |
Katakana
| Value | Count | Frequency (%) |
| テ | 1 | |
| ポ | 1 | |
| ィ | 1 | |
| ス | 1 | |
| タ | 1 | |
| ン | 1 | |
| ァ | 1 | |
| フ | 1 |
Arrows
| Value | Count | Frequency (%) |
| → | 1 |
overview
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 44231 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 946 |
| Missing (%) | 2.1% |
| Memory size | 354.4 KiB |
| Nooverviewfound. | 133 |
|---|---|
| NoOverview | 7 |
| Nomovieoverviewavailable. | 3 |
| AdaptationoftheJaneAustennovel. | 3 |
| Afewfunnylittlenovelsaboutdifferentaspectsoflife. | 3 |
| Other values (44226) |
Length
| Max length | 851 |
|---|---|
| Median length | 666 |
| Mean length | 269.13806 |
| Min length | 1 |
Characters and Unicode
| Total characters | 11949730 |
|---|---|
| Distinct characters | 423 |
| Distinct categories | 22 ? |
| Distinct scripts | 13 ? |
| Distinct blocks | 21 ? |
Unique
| Unique | 44201 ? |
|---|---|
| Unique (%) | 99.6% |
Sample
| 1st row | LedbyWoody,Andy'stoyslivehappilyinhisroomuntilAndy'sbirthdaybringsBuzzLightyearontothescene.AfraidoflosinghisplaceinAndy'sheart,WoodyplotsagainstBuzz.ButwhencircumstancesseparateBuzzandWoodyfromtheirowner,theduoeventuallylearnstoputasidetheirdifferences. |
|---|---|
| 2nd row | WhensiblingsJudyandPeterdiscoveranenchantedboardgamethatopensthedoortoamagicalworld,theyunwittinglyinviteAlan--anadultwho'sbeentrappedinsidethegamefor26years--intotheirlivingroom.Alan'sonlyhopeforfreedomistofinishthegame,whichprovesriskyasallthreefindthemselvesrunningfromgiantrhinoceroses,evilmonkeysandotherterrifyingcreatures. |
| 3rd row | Afamilyweddingreignitestheancientfeudbetweennext-doorneighborsandfishingbuddiesJohnandMax.Meanwhile,asultryItaliandivorcéeopensarestaurantatthelocalbaitshop,alarmingthelocalswhoworryshe'llscarethefishaway.Butshe'slessinterestedinseafoodthansheisincookingupahottimewithMax. |
| 4th row | Cheatedon,mistreatedandsteppedon,thewomenareholdingtheirbreath,waitingfortheelusive"goodman"tobreakastringofless-than-stellarlovers.FriendsandconfidantsVannah,Bernie,GloandRobintalkitallout,determinedtofindabetterwaytobreathe. |
| 5th row | JustwhenGeorgeBankshasrecoveredfromhisdaughter'swedding,hereceivesthenewsthatshe'spregnant...andthatGeorge'swife,Nina,isexpectingtoo.Hewasplanningonsellingtheirhome,butthat'saplanthat--likeGeorge--willhavetochangewiththearrivalofbothagrandchildandakidofhisown. |
Common Values
| Value | Count | Frequency (%) |
| Nooverviewfound. | 133 | 0.3% |
| NoOverview | 7 | < 0.1% |
| Nomovieoverviewavailable. | 3 | < 0.1% |
| AdaptationoftheJaneAustennovel. | 3 | < 0.1% |
| Afewfunnylittlenovelsaboutdifferentaspectsoflife. | 3 | < 0.1% |
| Whenfourwomenmoveintoanoldhouseleftbyonewoman'saunt,strangethingsbegintohappen.Bizarrevoices,visionsofghosts,andmysteriousnoisesleadthemtodiscoverthedarkestpowersofevilandahorrorandagonybeyondterror. | 2 | < 0.1% |
| AdventurerAllanQuartermainleadsanexpeditionintounchartedAfricanterritoryinanattempttolocateanexplorerwhowentmissingduringhissearchforthefableddiamondminesofKingSolomon. | 2 | < 0.1% |
| DirectorMichaelAptedrevisitsthesamegroupofBritish-bornadultsaftera7yearwait.Thesubjectsareinterviewedastothechangesthathaveoccurredintheirlivesduringthelastsevenyears. | 2 | < 0.1% |
| Wilburthepigisscaredoftheendoftheseason,becauseheknowsthatcomethattime,hewillenduponthedinnertable.HehatchesaplanwithCharlotte,aspiderthatlivesinhispen,toensurethatthiswillneverhappen. | 2 | < 0.1% |
| AwoodenboyBuratinotriestofindhisplaceinlife.HebefriendstoysfromatoytheaterownedbyevilKarabas-Barabas,getstrickedbyAlicetheFoxandBasiliotheCatandfinallydiscoversthemysteryofagoldenkeygiventohimbykindTortilatheTortoise. | 2 | < 0.1% |
| Other values (44221) | 44241 | |
| (Missing) | 946 | 2.1% |
Length
| Value | Count | Frequency (%) |
| nooverviewfound | 134 | 0.3% |
| nooverview | 9 | < 0.1% |
| nooverviewyet | 3 | < 0.1% |
| nomovieoverviewavailable | 3 | < 0.1% |
| adaptationofthejaneaustennovel | 3 | < 0.1% |
| afewfunnylittlenovelsaboutdifferentaspectsoflife | 3 | < 0.1% |
| funny,entertainingcomedywithafewstorylines.allofthemhaveonethingincommon-aresorttownofriminiinitaly | 2 | < 0.1% |
| mary,awriterworkingonanovelaboutalovetriangle,isattractedtoherpublisher.hersuitorjimmyisdeterminedtobreakthemup;heintroducesmarytothepublisher'swifewithouttellingmarywhosheis | 2 | < 0.1% |
| nickcarraway,ayoungmidwesternernowlivingonlongisland,findshimselffascinatedbythemysteriouspastandlavishlifestyleofhisneighbor,thenouveaurichejaygatsby.heisdrawnintogatsby'scircle,becomingawitnesstoobsessionandtragedy | 2 | < 0.1% |
| poorbuthappy,youngnelloandhisgrandfatherlivealone,deliveringmilkasalivelihood,intheoutskirtsofantwerp,acityinflanders(theflemishordutch-speakingpartofmodern-daybelgium).theydiscoverabeatendog(abouvier,alargesturdydognativetoflanders)andadoptitandnurseitbacktohealth,namingitpatrasche,themiddlenameofnello'smothermary,whodiedwhennellowasveryyoung.nello'smotherwasatalentedartist,andlikehismother,hedelightsindrawing,andhisfriendaloiseishismodelandgreatestfanandsupporter | 2 | < 0.1% |
| Other values (44215) | 44237 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1362739 | 11.4% |
| a | 939721 | 7.9% |
| t | 934056 | 7.8% |
| i | 850842 | 7.1% |
| o | 829251 | 6.9% |
| n | 821950 | 6.9% |
| s | 767224 | 6.4% |
| r | 743645 | 6.2% |
| h | 600339 | 5.0% |
| l | 478418 | 4.0% |
| Other values (413) | 3621545 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11141275 | |
| Uppercase Letter | 390651 | 3.3% |
| Other Punctuation | 312523 | 2.6% |
| Decimal Number | 42192 | 0.4% |
| Dash Punctuation | 36745 | 0.3% |
| Close Punctuation | 10094 | 0.1% |
| Open Punctuation | 10071 | 0.1% |
| Final Punctuation | 4549 | < 0.1% |
| Initial Punctuation | 880 | < 0.1% |
| Currency Symbol | 329 | < 0.1% |
| Other values (12) | 421 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1362739 | |
| a | 939721 | 8.4% |
| t | 934056 | 8.4% |
| i | 850842 | 7.6% |
| o | 829251 | 7.4% |
| n | 821950 | 7.4% |
| s | 767224 | 6.9% |
| r | 743645 | 6.7% |
| h | 600339 | 5.4% |
| l | 478418 | 4.3% |
| Other values (142) | 2813090 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 42722 | 10.9% |
| T | 35940 | 9.2% |
| S | 31102 | 8.0% |
| M | 23942 | 6.1% |
| B | 23679 | 6.1% |
| C | 22771 | 5.8% |
| H | 19415 | 5.0% |
| W | 18633 | 4.8% |
| I | 16782 | 4.3% |
| D | 16306 | 4.2% |
| Other values (77) | 139359 |
Other Letter
| Value | Count | Frequency (%) |
| न | 6 | 4.8% |
| र | 6 | 4.8% |
| म | 5 | 4.0% |
| の | 4 | 3.2% |
| प | 3 | 2.4% |
| ద | 3 | 2.4% |
| द | 3 | 2.4% |
| अ | 3 | 2.4% |
| ர | 2 | 1.6% |
| व | 2 | 1.6% |
| Other values (76) | 88 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 133326 | |
| . | 124703 | |
| ' | 31039 | 9.9% |
| " | 11660 | 3.7% |
| : | 3294 | 1.1% |
| ? | 2759 | 0.9% |
| ; | 2492 | 0.8% |
| ! | 1540 | 0.5% |
| / | 765 | 0.2% |
| & | 452 | 0.1% |
| Other values (12) | 493 | 0.2% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 4 | |
| ి | 4 | |
| ் | 3 | |
| ్ | 3 | |
| ् | 3 | |
| ̈ | 3 | |
| ా | 2 | 6.1% |
| े | 2 | 6.1% |
| ं | 2 | 6.1% |
| ु | 2 | 6.1% |
| Other values (4) | 5 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 9738 | |
| 0 | 8262 | |
| 9 | 6399 | |
| 2 | 4249 | |
| 5 | 2439 | 5.8% |
| 8 | 2378 | 5.6% |
| 3 | 2338 | 5.5% |
| 4 | 2173 | 5.2% |
| 7 | 2131 | 5.1% |
| 6 | 2085 | 4.9% |
Spacing Mark
| Value | Count | Frequency (%) |
| ा | 11 | |
| ी | 4 | 14.8% |
| ు | 3 | 11.1% |
| ो | 3 | 11.1% |
| ு | 2 | 7.4% |
| ि | 2 | 7.4% |
| ం | 1 | 3.7% |
| ி | 1 | 3.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 35222 | |
| – | 881 | 2.4% |
| — | 633 | 1.7% |
| ― | 5 | < 0.1% |
| ‐ | 4 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ® | 45 | |
| ™ | 14 | 21.9% |
| ° | 2 | 3.1% |
| ¦ | 2 | 3.1% |
| � | 1 | 1.6% |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 20 | |
| + | 11 | |
| = | 6 | 15.0% |
| | | 2 | 5.0% |
| − | 1 | 2.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 10018 | |
| [ | 50 | 0.5% |
| { | 2 | < 0.1% |
| „ | 1 | < 0.1% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 317 | |
| £ | 10 | 3.0% |
| ₹ | 1 | 0.3% |
| € | 1 | 0.3% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 10042 | |
| ] | 50 | 0.5% |
| } | 2 | < 0.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 3842 | |
| ” | 688 | 15.1% |
| » | 19 | 0.4% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 670 | |
| ‘ | 192 | 21.8% |
| « | 18 | 2.0% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 25 | |
| ` | 12 | |
| ¯ | 1 | 2.6% |
Format
| Value | Count | Frequency (%) |
| | 31 | |
| | 20 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 8 | |
| ¹ | 8 |
Control
| Value | Count | Frequency (%) |
| | 3 | |
| | 1 | 25.0% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 19 |
Letter Number
| Value | Count | Frequency (%) |
| Ⅱ | 2 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʼ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11526694 | |
| Common | 417617 | 3.5% |
| Cyrillic | 4587 | < 0.1% |
| Greek | 648 | < 0.1% |
| Devanagari | 77 | < 0.1% |
| Telugu | 30 | < 0.1% |
| Hiragana | 20 | < 0.1% |
| Tamil | 19 | < 0.1% |
| Han | 10 | < 0.1% |
| Hangul | 9 | < 0.1% |
| Other values (3) | 19 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1362739 | |
| a | 939721 | 8.2% |
| t | 934056 | 8.1% |
| i | 850842 | 7.4% |
| o | 829251 | 7.2% |
| n | 821950 | 7.1% |
| s | 767224 | 6.7% |
| r | 743645 | 6.5% |
| h | 600339 | 5.2% |
| l | 478418 | 4.2% |
| Other values (132) | 3198509 |
Common
| Value | Count | Frequency (%) |
| , | 133326 | |
| . | 124703 | |
| - | 35222 | 8.4% |
| ' | 31039 | 7.4% |
| " | 11660 | 2.8% |
| ) | 10042 | 2.4% |
| ( | 10018 | 2.4% |
| 1 | 9738 | 2.3% |
| 0 | 8262 | 2.0% |
| 9 | 6399 | 1.5% |
| Other values (65) | 37208 | 8.9% |
Cyrillic
| Value | Count | Frequency (%) |
| о | 470 | 10.2% |
| е | 404 | 8.8% |
| а | 373 | 8.1% |
| н | 323 | 7.0% |
| и | 299 | 6.5% |
| т | 265 | 5.8% |
| р | 240 | 5.2% |
| с | 218 | 4.8% |
| в | 173 | 3.8% |
| л | 161 | 3.5% |
| Other values (46) | 1661 |
Greek
| Value | Count | Frequency (%) |
| α | 60 | 9.3% |
| ο | 55 | 8.5% |
| τ | 43 | 6.6% |
| η | 36 | 5.6% |
| ι | 36 | 5.6% |
| ν | 34 | 5.2% |
| ρ | 31 | 4.8% |
| ε | 31 | 4.8% |
| π | 30 | 4.6% |
| ς | 30 | 4.6% |
| Other values (33) | 262 |
Devanagari
| Value | Count | Frequency (%) |
| ा | 11 | 14.3% |
| न | 6 | 7.8% |
| र | 6 | 7.8% |
| म | 5 | 6.5% |
| ी | 4 | 5.2% |
| प | 3 | 3.9% |
| ् | 3 | 3.9% |
| ो | 3 | 3.9% |
| द | 3 | 3.9% |
| अ | 3 | 3.9% |
| Other values (21) | 30 |
Hiragana
| Value | Count | Frequency (%) |
| の | 4 | |
| め | 1 | 5.0% |
| さ | 1 | 5.0% |
| ん | 1 | 5.0% |
| ひ | 1 | 5.0% |
| ち | 1 | 5.0% |
| ず | 1 | 5.0% |
| か | 1 | 5.0% |
| み | 1 | 5.0% |
| け | 1 | 5.0% |
| Other values (7) | 7 |
Telugu
| Value | Count | Frequency (%) |
| ి | 4 | |
| ు | 3 | |
| ద | 3 | |
| ్ | 3 | |
| ా | 2 | 6.7% |
| ర | 2 | 6.7% |
| మ | 2 | 6.7% |
| స | 2 | 6.7% |
| న | 2 | 6.7% |
| ె | 1 | 3.3% |
| Other values (6) | 6 |
Tamil
| Value | Count | Frequency (%) |
| ் | 3 | |
| ர | 2 | |
| ு | 2 | |
| ப | 2 | |
| ம | 2 | |
| ச | 1 | 5.3% |
| ன | 1 | 5.3% |
| வ | 1 | 5.3% |
| த | 1 | 5.3% |
| ஆ | 1 | 5.3% |
| Other values (3) | 3 |
Han
| Value | Count | Frequency (%) |
| 者 | 1 | |
| 患 | 1 | |
| 水 | 1 | |
| 俣 | 1 | |
| 界 | 1 | |
| 世 | 1 | |
| 見 | 1 | |
| 鬼 | 1 | |
| 難 | 1 | |
| 海 | 1 |
Hangul
| Value | Count | Frequency (%) |
| 사 | 2 | |
| 식 | 1 | |
| 회 | 1 | |
| 주 | 1 | |
| 기 | 1 | |
| 찾 | 1 | |
| 랑 | 1 | |
| 첫 | 1 |
Thai
| Value | Count | Frequency (%) |
| ่ | 2 | |
| พ | 1 | |
| ร | 1 | |
| ง | 1 | |
| แ | 1 | |
| ส | 1 | |
| ี | 1 |
Arabic
| Value | Count | Frequency (%) |
| م | 2 | |
| ہ | 1 | |
| ت | 1 |
Inherited
| Value | Count | Frequency (%) |
| ́ | 4 | |
| ̈ | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11931797 | |
| Punctuation | 7252 | 0.1% |
| None | 5883 | < 0.1% |
| Cyrillic | 4587 | < 0.1% |
| Devanagari | 77 | < 0.1% |
| Telugu | 30 | < 0.1% |
| Hiragana | 20 | < 0.1% |
| Tamil | 19 | < 0.1% |
| Letterlike Symbols | 14 | < 0.1% |
| CJK | 10 | < 0.1% |
| Other values (11) | 41 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1362739 | 11.4% |
| a | 939721 | 7.9% |
| t | 934056 | 7.8% |
| i | 850842 | 7.1% |
| o | 829251 | 6.9% |
| n | 821950 | 6.9% |
| s | 767224 | 6.4% |
| r | 743645 | 6.2% |
| h | 600339 | 5.0% |
| l | 478418 | 4.0% |
| Other values (80) | 3603612 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 3842 | |
| – | 881 | 12.1% |
| ” | 688 | 9.5% |
| “ | 670 | 9.2% |
| — | 633 | 8.7% |
| … | 303 | 4.2% |
| ‘ | 192 | 2.6% |
| | 31 | 0.4% |
| ― | 5 | 0.1% |
| ‐ | 4 | 0.1% |
| Other values (2) | 3 | < 0.1% |
None
| Value | Count | Frequency (%) |
| é | 1544 | |
| ä | 294 | 5.0% |
| á | 293 | 5.0% |
| ö | 250 | 4.2% |
| í | 243 | 4.1% |
| è | 209 | 3.6% |
| ü | 178 | 3.0% |
| ı | 165 | 2.8% |
| ó | 164 | 2.8% |
| ç | 158 | 2.7% |
| Other values (139) | 2385 |
Cyrillic
| Value | Count | Frequency (%) |
| о | 470 | 10.2% |
| е | 404 | 8.8% |
| а | 373 | 8.1% |
| н | 323 | 7.0% |
| и | 299 | 6.5% |
| т | 265 | 5.8% |
| р | 240 | 5.2% |
| с | 218 | 4.8% |
| в | 173 | 3.8% |
| л | 161 | 3.5% |
| Other values (46) | 1661 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 14 |
Devanagari
| Value | Count | Frequency (%) |
| ा | 11 | 14.3% |
| न | 6 | 7.8% |
| र | 6 | 7.8% |
| म | 5 | 6.5% |
| ी | 4 | 5.2% |
| प | 3 | 3.9% |
| ् | 3 | 3.9% |
| ो | 3 | 3.9% |
| द | 3 | 3.9% |
| अ | 3 | 3.9% |
| Other values (21) | 30 |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 4 | |
| ̈ | 3 |
Hiragana
| Value | Count | Frequency (%) |
| の | 4 | |
| め | 1 | 5.0% |
| さ | 1 | 5.0% |
| ん | 1 | 5.0% |
| ひ | 1 | 5.0% |
| ち | 1 | 5.0% |
| ず | 1 | 5.0% |
| か | 1 | 5.0% |
| み | 1 | 5.0% |
| け | 1 | 5.0% |
| Other values (7) | 7 |
Alphabetic PF
| Value | Count | Frequency (%) |
| fi | 4 |
Telugu
| Value | Count | Frequency (%) |
| ి | 4 | |
| ు | 3 | |
| ద | 3 | |
| ్ | 3 | |
| ా | 2 | 6.7% |
| ర | 2 | 6.7% |
| మ | 2 | 6.7% |
| స | 2 | 6.7% |
| న | 2 | 6.7% |
| ె | 1 | 3.3% |
| Other values (6) | 6 |
Tamil
| Value | Count | Frequency (%) |
| ் | 3 | |
| ர | 2 | |
| ு | 2 | |
| ப | 2 | |
| ம | 2 | |
| ச | 1 | 5.3% |
| ன | 1 | 5.3% |
| வ | 1 | 5.3% |
| த | 1 | 5.3% |
| ஆ | 1 | 5.3% |
| Other values (3) | 3 |
Number Forms
| Value | Count | Frequency (%) |
| Ⅱ | 2 |
Hangul
| Value | Count | Frequency (%) |
| 사 | 2 | |
| 식 | 1 | |
| 회 | 1 | |
| 주 | 1 | |
| 기 | 1 | |
| 찾 | 1 | |
| 랑 | 1 | |
| 첫 | 1 |
Arabic
| Value | Count | Frequency (%) |
| م | 2 | |
| ہ | 1 | |
| ت | 1 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʼ | 2 |
Thai
| Value | Count | Frequency (%) |
| ่ | 2 | |
| พ | 1 | |
| ร | 1 | |
| ง | 1 | |
| แ | 1 | |
| ส | 1 | |
| ี | 1 |
CJK
| Value | Count | Frequency (%) |
| 者 | 1 | |
| 患 | 1 | |
| 水 | 1 | |
| 俣 | 1 | |
| 界 | 1 | |
| 世 | 1 | |
| 見 | 1 | |
| 鬼 | 1 | |
| 難 | 1 | |
| 海 | 1 |
Math Operators
| Value | Count | Frequency (%) |
| − | 1 |
Specials
| Value | Count | Frequency (%) |
| � | 1 |
Katakana
| Value | Count | Frequency (%) |
| ・ | 1 |
Currency Symbols
| Value | Count | Frequency (%) |
| ₹ | 1 | |
| € | 1 |
popularity
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 43719 |
|---|---|
| Distinct (%) | 96.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.926188 |
| Minimum | 0 |
|---|---|
| Maximum | 547.4883 |
| Zeros | 40 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.020823 |
| Q1 | 0.38873225 |
| median | 1.130176 |
| Q3 | 3.6893365 |
| 95-th percentile | 11.063757 |
| Maximum | 547.4883 |
| Range | 547.4883 |
| Interquartile range (IQR) | 3.3006043 |
Descriptive statistics
| Standard deviation | 6.0109699 |
|---|---|
| Coefficient of variation (CV) | 2.0541981 |
| Kurtosis | 1923.3033 |
| Mean | 2.926188 |
| Median Absolute Deviation (MAD) | 0.967289 |
| Skewness | 29.215423 |
| Sum | 132690.92 |
| Variance | 36.131759 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 × 10-6 | 56 | 0.1% |
| 0.000308 | 42 | 0.1% |
| 0 | 40 | 0.1% |
| 0.00022 | 39 | 0.1% |
| 0.001177 | 38 | 0.1% |
| 0.000844 | 38 | 0.1% |
| 0.000578 | 38 | 0.1% |
| 0.002001 | 27 | 0.1% |
| 0.003013 | 21 | < 0.1% |
| 0.00353 | 19 | < 0.1% |
| Other values (43709) | 44988 |
| Value | Count | Frequency (%) |
| 0 | 40 | |
| 1 × 10-6 | 56 | |
| 2 × 10-6 | 6 | < 0.1% |
| 3 × 10-6 | 6 | < 0.1% |
| 4 × 10-6 | 5 | < 0.1% |
| 5 × 10-6 | 1 | < 0.1% |
| 6 × 10-6 | 2 | < 0.1% |
| 7 × 10-6 | 1 | < 0.1% |
| 8 × 10-6 | 6 | < 0.1% |
| 9 × 10-6 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 547.488298 | 1 | |
| 294.337037 | 1 | |
| 287.253654 | 1 | |
| 228.032744 | 1 | |
| 213.849907 | 1 | |
| 187.860492 | 1 | |
| 185.330992 | 1 | |
| 185.070892 | 1 | |
| 183.870374 | 1 | |
| 154.801009 | 1 |
vote_average
Real number (ℝ)
| Distinct | 92 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.6241962 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 2944 |
| Zeros (%) | 6.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5 |
| median | 6 |
| Q3 | 6.8 |
| 95-th percentile | 7.8 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 1.8 |
Descriptive statistics
| Standard deviation | 1.915339 |
|---|---|
| Coefficient of variation (CV) | 0.34055337 |
| Kurtosis | 2.5420383 |
| Mean | 5.6241962 |
| Median Absolute Deviation (MAD) | 0.9 |
| Skewness | -1.5243174 |
| Sum | 255034.8 |
| Variance | 3.6685234 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2944 | 6.5% |
| 6 | 2461 | 5.4% |
| 5 | 1994 | 4.4% |
| 7 | 1882 | 4.2% |
| 6.5 | 1722 | 3.8% |
| 6.3 | 1602 | 3.5% |
| 5.5 | 1381 | 3.0% |
| 5.8 | 1369 | 3.0% |
| 6.4 | 1348 | 3.0% |
| 6.7 | 1339 | 3.0% |
| Other values (82) | 27304 |
| Value | Count | Frequency (%) |
| 0 | 2944 | |
| 0.5 | 13 | < 0.1% |
| 0.7 | 1 | < 0.1% |
| 1 | 103 | 0.2% |
| 1.1 | 1 | < 0.1% |
| 1.2 | 4 | < 0.1% |
| 1.3 | 13 | < 0.1% |
| 1.4 | 5 | < 0.1% |
| 1.5 | 30 | 0.1% |
| 1.6 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 185 | |
| 9.8 | 1 | < 0.1% |
| 9.6 | 1 | < 0.1% |
| 9.5 | 18 | < 0.1% |
| 9.4 | 3 | < 0.1% |
| 9.3 | 18 | < 0.1% |
| 9.2 | 4 | < 0.1% |
| 9.1 | 2 | < 0.1% |
| 9 | 158 | |
| 8.9 | 7 | < 0.1% |
vote_count
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 1820 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 110.13529 |
| Minimum | 0 |
|---|---|
| Maximum | 14075 |
| Zeros | 2846 |
| Zeros (%) | 6.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 10 |
| Q3 | 34 |
| 95-th percentile | 434.75 |
| Maximum | 14075 |
| Range | 14075 |
| Interquartile range (IQR) | 31 |
Descriptive statistics
| Standard deviation | 491.89928 |
|---|---|
| Coefficient of variation (CV) | 4.4663183 |
| Kurtosis | 150.83135 |
| Mean | 110.13529 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 10.437494 |
| Sum | 4994195 |
| Variance | 241964.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3240 | 7.1% |
| 2 | 3127 | 6.9% |
| 0 | 2846 | 6.3% |
| 3 | 2780 | 6.1% |
| 4 | 2477 | 5.5% |
| 5 | 2096 | 4.6% |
| 6 | 1747 | 3.9% |
| 7 | 1568 | 3.5% |
| 8 | 1359 | 3.0% |
| 9 | 1194 | 2.6% |
| Other values (1810) | 22912 |
| Value | Count | Frequency (%) |
| 0 | 2846 | |
| 1 | 3240 | |
| 2 | 3127 | |
| 3 | 2780 | |
| 4 | 2477 | |
| 5 | 2096 | |
| 6 | 1747 | |
| 7 | 1568 | |
| 8 | 1359 | |
| 9 | 1194 | 2.6% |
| Value | Count | Frequency (%) |
| 14075 | 1 | |
| 12269 | 1 | |
| 12114 | 1 | |
| 12000 | 1 | |
| 11444 | 1 | |
| 11187 | 1 | |
| 10297 | 1 | |
| 10014 | 1 | |
| 9678 | 1 | |
| 9634 | 1 |
status
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 80 |
| Missing (%) | 0.2% |
| Memory size | 354.4 KiB |
| Released | |
|---|---|
| Rumored | 229 |
| PostProduction | 97 |
| InProduction | 19 |
| Planned | 13 |
Length
| Max length | 14 |
|---|---|
| Median length | 8 |
| Mean length | 8.0091901 |
| Min length | 7 |
Characters and Unicode
| Total characters | 362544 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Released |
|---|---|
| 2nd row | Released |
| 3rd row | Released |
| 4th row | Released |
| 5th row | Released |
Common Values
| Value | Count | Frequency (%) |
| Released | 44907 | |
| Rumored | 229 | 0.5% |
| PostProduction | 97 | 0.2% |
| InProduction | 19 | < 0.1% |
| Planned | 13 | < 0.1% |
| Canceled | 1 | < 0.1% |
| (Missing) | 80 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| released | 44907 | |
| rumored | 229 | 0.5% |
| postproduction | 97 | 0.2% |
| inproduction | 19 | < 0.1% |
| planned | 13 | < 0.1% |
| canceled | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 134965 | |
| d | 45266 | 12.5% |
| R | 45136 | 12.4% |
| s | 45004 | 12.4% |
| l | 44921 | 12.4% |
| a | 44921 | 12.4% |
| o | 558 | 0.2% |
| u | 345 | 0.1% |
| r | 345 | 0.1% |
| m | 229 | 0.1% |
| Other values (7) | 854 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 317162 | |
| Uppercase Letter | 45382 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 134965 | |
| d | 45266 | 14.3% |
| s | 45004 | 14.2% |
| l | 44921 | 14.2% |
| a | 44921 | 14.2% |
| o | 558 | 0.2% |
| u | 345 | 0.1% |
| r | 345 | 0.1% |
| m | 229 | 0.1% |
| t | 213 | 0.1% |
| Other values (3) | 395 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 45136 | |
| P | 226 | 0.5% |
| I | 19 | < 0.1% |
| C | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 362544 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 134965 | |
| d | 45266 | 12.5% |
| R | 45136 | 12.4% |
| s | 45004 | 12.4% |
| l | 44921 | 12.4% |
| a | 44921 | 12.4% |
| o | 558 | 0.2% |
| u | 345 | 0.1% |
| r | 345 | 0.1% |
| m | 229 | 0.1% |
| Other values (7) | 854 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 362544 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 134965 | |
| d | 45266 | 12.5% |
| R | 45136 | 12.4% |
| s | 45004 | 12.4% |
| l | 44921 | 12.4% |
| a | 44921 | 12.4% |
| o | 558 | 0.2% |
| u | 345 | 0.1% |
| r | 345 | 0.1% |
| m | 229 | 0.1% |
| Other values (7) | 854 | 0.2% |
original_language
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 89 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 11 |
| Missing (%) | < 0.1% |
| Memory size | 354.4 KiB |
| en | |
|---|---|
| fr | 2435 |
| it | 1528 |
| ja | 1346 |
| de | 1077 |
| Other values (84) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 90670 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 17 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | en |
|---|---|
| 2nd row | en |
| 3rd row | en |
| 4th row | en |
| 5th row | en |
Common Values
| Value | Count | Frequency (%) |
| en | 32184 | |
| fr | 2435 | 5.4% |
| it | 1528 | 3.4% |
| ja | 1346 | 3.0% |
| de | 1077 | 2.4% |
| es | 992 | 2.2% |
| ru | 822 | 1.8% |
| hi | 508 | 1.1% |
| ko | 444 | 1.0% |
| zh | 408 | 0.9% |
| Other values (79) | 3591 | 7.9% |
Length
| Value | Count | Frequency (%) |
| en | 32184 | |
| fr | 2435 | 5.4% |
| it | 1528 | 3.4% |
| ja | 1346 | 3.0% |
| de | 1077 | 2.4% |
| es | 992 | 2.2% |
| ru | 822 | 1.8% |
| hi | 508 | 1.1% |
| ko | 444 | 1.0% |
| zh | 408 | 0.9% |
| Other values (79) | 3591 | 7.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 34508 | |
| n | 32892 | |
| r | 3628 | 4.0% |
| f | 2830 | 3.1% |
| i | 2386 | 2.6% |
| t | 2249 | 2.5% |
| a | 1834 | 2.0% |
| s | 1651 | 1.8% |
| j | 1347 | 1.5% |
| d | 1321 | 1.5% |
| Other values (16) | 6024 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 90670 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 34508 | |
| n | 32892 | |
| r | 3628 | 4.0% |
| f | 2830 | 3.1% |
| i | 2386 | 2.6% |
| t | 2249 | 2.5% |
| a | 1834 | 2.0% |
| s | 1651 | 1.8% |
| j | 1347 | 1.5% |
| d | 1321 | 1.5% |
| Other values (16) | 6024 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 90670 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 34508 | |
| n | 32892 | |
| r | 3628 | 4.0% |
| f | 2830 | 3.1% |
| i | 2386 | 2.6% |
| t | 2249 | 2.5% |
| a | 1834 | 2.0% |
| s | 1651 | 1.8% |
| j | 1347 | 1.5% |
| d | 1321 | 1.5% |
| Other values (16) | 6024 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 90670 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 34508 | |
| n | 32892 | |
| r | 3628 | 4.0% |
| f | 2830 | 3.1% |
| i | 2386 | 2.6% |
| t | 2249 | 2.5% |
| a | 1834 | 2.0% |
| s | 1651 | 1.8% |
| j | 1347 | 1.5% |
| d | 1321 | 1.5% |
| Other values (16) | 6024 | 6.6% |
runtime
Real number (ℝ)
| Distinct | 353 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 93.666895 |
| Minimum | 0 |
|---|---|
| Maximum | 1256 |
| Zeros | 1781 |
| Zeros (%) | 3.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 85 |
| median | 95 |
| Q3 | 107 |
| 95-th percentile | 138 |
| Maximum | 1256 |
| Range | 1256 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 38.865238 |
|---|---|
| Coefficient of variation (CV) | 0.41493036 |
| Kurtosis | 88.775055 |
| Mean | 93.666895 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 4.2532768 |
| Sum | 4247419 |
| Variance | 1510.5067 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 2548 | 5.6% |
| 0 | 1781 | 3.9% |
| 100 | 1470 | 3.2% |
| 95 | 1409 | 3.1% |
| 93 | 1212 | 2.7% |
| 96 | 1104 | 2.4% |
| 92 | 1078 | 2.4% |
| 94 | 1061 | 2.3% |
| 91 | 1055 | 2.3% |
| 88 | 1030 | 2.3% |
| Other values (343) | 31598 |
| Value | Count | Frequency (%) |
| 0 | 1781 | |
| 1 | 107 | 0.2% |
| 2 | 33 | 0.1% |
| 3 | 48 | 0.1% |
| 4 | 50 | 0.1% |
| 5 | 51 | 0.1% |
| 6 | 72 | 0.2% |
| 7 | 103 | 0.2% |
| 8 | 78 | 0.2% |
| 9 | 63 | 0.1% |
| Value | Count | Frequency (%) |
| 1256 | 1 | |
| 1140 | 2 | |
| 931 | 1 | |
| 925 | 1 | |
| 900 | 1 | |
| 877 | 1 | |
| 874 | 1 | |
| 840 | 2 | |
| 780 | 1 | |
| 720 | 1 |
budget
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 1223 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4232579.8 |
| Minimum | 0 |
|---|---|
| Maximum | 3.8 × 108 |
| Zeros | 36470 |
| Zeros (%) | 80.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 25000000 |
| Maximum | 3.8 × 108 |
| Range | 3.8 × 108 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 17443731 |
|---|---|
| Coefficient of variation (CV) | 4.1213 |
| Kurtosis | 66.618217 |
| Mean | 4232579.8 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.1180066 |
| Sum | 1.9193056 × 1011 |
| Variance | 3.0428374 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 36470 | |
| 5000000 | 286 | 0.6% |
| 10000000 | 258 | 0.6% |
| 20000000 | 243 | 0.5% |
| 2000000 | 242 | 0.5% |
| 15000000 | 226 | 0.5% |
| 3000000 | 223 | 0.5% |
| 25000000 | 206 | 0.5% |
| 1000000 | 197 | 0.4% |
| 30000000 | 189 | 0.4% |
| Other values (1213) | 6806 | 15.0% |
| Value | Count | Frequency (%) |
| 0 | 36470 | |
| 1 | 25 | 0.1% |
| 2 | 14 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 7 | < 0.1% |
| 5 | 8 | < 0.1% |
| 6 | 5 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 5 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 380000000 | 1 | < 0.1% |
| 300000000 | 1 | < 0.1% |
| 280000000 | 1 | < 0.1% |
| 270000000 | 1 | < 0.1% |
| 260000000 | 3 | < 0.1% |
| 258000000 | 1 | < 0.1% |
| 255000000 | 1 | < 0.1% |
| 250000000 | 10 | |
| 245000000 | 2 | < 0.1% |
| 237000000 | 1 | < 0.1% |
revenue
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 6863 |
|---|---|
| Distinct (%) | 15.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11233655 |
| Minimum | 0 |
|---|---|
| Maximum | 2.7879651 × 109 |
| Zeros | 37949 |
| Zeros (%) | 83.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 48025328 |
| Maximum | 2.7879651 × 109 |
| Range | 2.7879651 × 109 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 64409896 |
|---|---|
| Coefficient of variation (CV) | 5.7336544 |
| Kurtosis | 236.93621 |
| Mean | 11233655 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 12.251264 |
| Sum | 5.0940133 × 1011 |
| Variance | 4.1486347 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 37949 | |
| 12000000 | 20 | < 0.1% |
| 11000000 | 19 | < 0.1% |
| 10000000 | 19 | < 0.1% |
| 2000000 | 18 | < 0.1% |
| 6000000 | 17 | < 0.1% |
| 5000000 | 14 | < 0.1% |
| 8000000 | 13 | < 0.1% |
| 500000 | 13 | < 0.1% |
| 1 | 12 | < 0.1% |
| Other values (6853) | 7252 | 16.0% |
| Value | Count | Frequency (%) |
| 0 | 37949 | |
| 1 | 12 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 5 | < 0.1% |
| 6 | 2 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 5 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2787965087 | 1 | |
| 2068223624 | 1 | |
| 1845034188 | 1 | |
| 1519557910 | 1 | |
| 1513528810 | 1 | |
| 1506249360 | 1 | |
| 1405403694 | 1 | |
| 1342000000 | 1 | |
| 1274219009 | 1 | |
| 1262886337 | 1 |
tagline
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 20268 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 24960 |
| Missing (%) | 55.0% |
| Memory size | 354.4 KiB |
| Basedonatruestory. | 7 |
|---|---|
| Trustnoone. | 4 |
| - | 4 |
| Becarefulwhatyouwishfor. | 4 |
| KnowYourEnemy | 3 |
| Other values (20263) |
Length
| Max length | 259 |
|---|---|
| Median length | 179 |
| Mean length | 39.464093 |
| Min length | 1 |
Characters and Unicode
| Total characters | 804515 |
|---|---|
| Distinct characters | 169 |
| Distinct categories | 16 ? |
| Distinct scripts | 6 ? |
| Distinct blocks | 10 ? |
Unique
| Unique | 20172 ? |
|---|---|
| Unique (%) | 99.0% |
Sample
| 1st row | Rollthediceandunleashtheexcitement! |
|---|---|
| 2nd row | StillYelling.StillFighting.StillReadyforLove. |
| 3rd row | Friendsarethepeoplewholetyoubeyourself...andneverletyouforgetit. |
| 4th row | JustWhenHisWorldIsBackToNormal...He'sInForTheSurpriseOfHisLife! |
| 5th row | ALosAngelesCrimeSaga |
Common Values
| Value | Count | Frequency (%) |
| Basedonatruestory. | 7 | < 0.1% |
| Trustnoone. | 4 | < 0.1% |
| - | 4 | < 0.1% |
| Becarefulwhatyouwishfor. | 4 | < 0.1% |
| KnowYourEnemy | 3 | < 0.1% |
| ClassicAlbums | 3 | < 0.1% |
| Documentary | 3 | < 0.1% |
| Howfarwouldyougo? | 3 | < 0.1% |
| WhoisJohnGalt? | 3 | < 0.1% |
| Drama | 3 | < 0.1% |
| Other values (20258) | 20349 | |
| (Missing) | 24960 |
Length
| Value | Count | Frequency (%) |
| basedonatruestory | 11 | 0.1% |
| trustnoone | 7 | < 0.1% |
| becarefulwhatyouwishfor | 7 | < 0.1% |
| alovestory | 5 | < 0.1% |
| atruestory | 5 | < 0.1% |
| knowyourenemy | 4 | < 0.1% |
| documentary | 4 | < 0.1% |
| fightfirewithfire | 4 | < 0.1% |
| 4 | < 0.1% | |
| twofilms.onelove | 3 | < 0.1% |
| Other values (20091) | 20332 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 94342 | 11.7% |
| t | 57223 | 7.1% |
| o | 56534 | 7.0% |
| a | 51450 | 6.4% |
| n | 47460 | 5.9% |
| i | 46013 | 5.7% |
| r | 44957 | 5.6% |
| s | 42345 | 5.3% |
| h | 37144 | 4.6% |
| l | 30159 | 3.7% |
| Other values (159) | 296888 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 680045 | |
| Uppercase Letter | 74965 | 9.3% |
| Other Punctuation | 44556 | 5.5% |
| Decimal Number | 2687 | 0.3% |
| Dash Punctuation | 1942 | 0.2% |
| Final Punctuation | 98 | < 0.1% |
| Open Punctuation | 56 | < 0.1% |
| Close Punctuation | 55 | < 0.1% |
| Currency Symbol | 37 | < 0.1% |
| Other Letter | 34 | < 0.1% |
| Other values (6) | 40 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 94342 | |
| t | 57223 | 8.4% |
| o | 56534 | 8.3% |
| a | 51450 | 7.6% |
| n | 47460 | 7.0% |
| i | 46013 | 6.8% |
| r | 44957 | 6.6% |
| s | 42345 | 6.2% |
| h | 37144 | 5.5% |
| l | 30159 | 4.4% |
| Other values (43) | 172418 |
Other Letter
| Value | Count | Frequency (%) |
| 劇 | 1 | 2.9% |
| ஆ | 1 | 2.9% |
| 時 | 1 | 2.9% |
| 熟 | 1 | 2.9% |
| த | 1 | 2.9% |
| வ | 1 | 2.9% |
| 成 | 1 | 2.9% |
| ன | 1 | 2.9% |
| 最 | 1 | 2.9% |
| 場 | 1 | 2.9% |
| Other values (24) | 24 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 10007 | 13.3% |
| A | 6871 | 9.2% |
| S | 5648 | 7.5% |
| H | 4401 | 5.9% |
| I | 4387 | 5.9% |
| E | 4304 | 5.7% |
| W | 3678 | 4.9% |
| O | 3476 | 4.6% |
| L | 3193 | 4.3% |
| N | 3193 | 4.3% |
| Other values (20) | 25807 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 26640 | |
| ! | 5784 | 13.0% |
| ' | 5659 | 12.7% |
| , | 4222 | 9.5% |
| ? | 1159 | 2.6% |
| " | 582 | 1.3% |
| … | 148 | 0.3% |
| : | 137 | 0.3% |
| & | 83 | 0.2% |
| * | 42 | 0.1% |
| Other values (7) | 100 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 802 | |
| 1 | 516 | |
| 2 | 299 | 11.1% |
| 3 | 208 | 7.7% |
| 9 | 208 | 7.7% |
| 5 | 168 | 6.3% |
| 4 | 140 | 5.2% |
| 6 | 121 | 4.5% |
| 7 | 121 | 4.5% |
| 8 | 104 | 3.9% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 5 | |
| = | 5 | |
| | | 2 | 14.3% |
| ~ | 1 | 7.1% |
| − | 1 | 7.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1925 | |
| – | 9 | 0.5% |
| — | 8 | 0.4% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 82 | |
| ” | 15 | 15.3% |
| » | 1 | 1.0% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 14 | |
| ‘ | 4 | 21.1% |
| « | 1 | 5.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 49 | |
| [ | 7 | 12.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 48 | |
| ] | 7 | 12.7% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 2 | |
| ² | 1 |
Modifier Letter
| Value | Count | Frequency (%) |
| ˌ | 1 | |
| ˈ | 1 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 37 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ் | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 755010 | |
| Common | 49470 | 6.1% |
| Han | 21 | < 0.1% |
| Tamil | 5 | < 0.1% |
| Hiragana | 5 | < 0.1% |
| Katakana | 4 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 94342 | 12.5% |
| t | 57223 | 7.6% |
| o | 56534 | 7.5% |
| a | 51450 | 6.8% |
| n | 47460 | 6.3% |
| i | 46013 | 6.1% |
| r | 44957 | 6.0% |
| s | 42345 | 5.6% |
| h | 37144 | 4.9% |
| l | 30159 | 4.0% |
| Other values (73) | 247383 |
Common
| Value | Count | Frequency (%) |
| . | 26640 | |
| ! | 5784 | 11.7% |
| ' | 5659 | 11.4% |
| , | 4222 | 8.5% |
| - | 1925 | 3.9% |
| ? | 1159 | 2.3% |
| 0 | 802 | 1.6% |
| " | 582 | 1.2% |
| 1 | 516 | 1.0% |
| 2 | 299 | 0.6% |
| Other values (41) | 1882 | 3.8% |
Han
| Value | Count | Frequency (%) |
| 劇 | 1 | 4.8% |
| 時 | 1 | 4.8% |
| 熟 | 1 | 4.8% |
| 成 | 1 | 4.8% |
| 最 | 1 | 4.8% |
| 場 | 1 | 4.8% |
| 版 | 1 | 4.8% |
| 蜜 | 1 | 4.8% |
| 舞 | 1 | 4.8% |
| 的 | 1 | 4.8% |
| Other values (11) | 11 |
Tamil
| Value | Count | Frequency (%) |
| ஆ | 1 | |
| த | 1 | |
| வ | 1 | |
| ன | 1 | |
| ் | 1 |
Hiragana
| Value | Count | Frequency (%) |
| は | 1 | |
| し | 1 | |
| て | 1 | |
| い | 1 | |
| る | 1 |
Katakana
| Value | Count | Frequency (%) |
| ク | 1 | |
| ラ | 1 | |
| ナ | 1 | |
| ド | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 804086 | |
| Punctuation | 280 | < 0.1% |
| None | 109 | < 0.1% |
| CJK | 21 | < 0.1% |
| Tamil | 5 | < 0.1% |
| Hiragana | 5 | < 0.1% |
| Katakana | 4 | < 0.1% |
| IPA Ext | 2 | < 0.1% |
| Modifier Letters | 2 | < 0.1% |
| Math Operators | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 94342 | 11.7% |
| t | 57223 | 7.1% |
| o | 56534 | 7.0% |
| a | 51450 | 6.4% |
| n | 47460 | 5.9% |
| i | 46013 | 5.7% |
| r | 44957 | 5.6% |
| s | 42345 | 5.3% |
| h | 37144 | 4.6% |
| l | 30159 | 3.8% |
| Other values (77) | 296459 |
Punctuation
| Value | Count | Frequency (%) |
| … | 148 | |
| ’ | 82 | |
| ” | 15 | 5.4% |
| “ | 14 | 5.0% |
| – | 9 | 3.2% |
| — | 8 | 2.9% |
| ‘ | 4 | 1.4% |
None
| Value | Count | Frequency (%) |
| é | 17 | |
| ä | 16 | |
| ö | 8 | 7.3% |
| ó | 6 | 5.5% |
| á | 6 | 5.5% |
| ü | 5 | 4.6% |
| ı | 5 | 4.6% |
| í | 5 | 4.6% |
| · | 4 | 3.7% |
| ñ | 3 | 2.8% |
| Other values (26) | 34 |
IPA Ext
| Value | Count | Frequency (%) |
| ə | 2 |
CJK
| Value | Count | Frequency (%) |
| 劇 | 1 | 4.8% |
| 時 | 1 | 4.8% |
| 熟 | 1 | 4.8% |
| 成 | 1 | 4.8% |
| 最 | 1 | 4.8% |
| 場 | 1 | 4.8% |
| 版 | 1 | 4.8% |
| 蜜 | 1 | 4.8% |
| 舞 | 1 | 4.8% |
| 的 | 1 | 4.8% |
| Other values (11) | 11 |
Tamil
| Value | Count | Frequency (%) |
| ஆ | 1 | |
| த | 1 | |
| வ | 1 | |
| ன | 1 | |
| ் | 1 |
Katakana
| Value | Count | Frequency (%) |
| ク | 1 | |
| ラ | 1 | |
| ナ | 1 | |
| ド | 1 |
Modifier Letters
| Value | Count | Frequency (%) |
| ˌ | 1 | |
| ˈ | 1 |
Hiragana
| Value | Count | Frequency (%) |
| は | 1 | |
| し | 1 | |
| て | 1 | |
| い | 1 | |
| る | 1 |
Math Operators
| Value | Count | Frequency (%) |
| − | 1 |
id_btc
Real number (ℝ)
| Distinct | 1078 |
|---|---|
| Distinct (%) | 34.1% |
| Missing | 42183 |
| Missing (%) | 93.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 158900.63 |
| Minimum | 10 |
|---|---|
| Maximum | 479888 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.4 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 1617 |
| Q1 | 44913 |
| median | 115822 |
| Q3 | 253474.5 |
| 95-th percentile | 425164 |
| Maximum | 479888 |
| Range | 479878 |
| Interquartile range (IQR) | 208561.5 |
Descriptive statistics
| Standard deviation | 136342.15 |
|---|---|
| Coefficient of variation (CV) | 0.85803401 |
| Kurtosis | -0.50880722 |
| Mean | 158900.63 |
| Median Absolute Deviation (MAD) | 91105 |
| Skewness | 0.7806269 |
| Sum | 5.0260271 × 108 |
| Variance | 1.8589182 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 415931 | 29 | 0.1% |
| 421566 | 27 | 0.1% |
| 96887 | 26 | 0.1% |
| 645 | 26 | 0.1% |
| 37261 | 25 | 0.1% |
| 34055 | 20 | < 0.1% |
| 374509 | 16 | < 0.1% |
| 38451 | 15 | < 0.1% |
| 425164 | 15 | < 0.1% |
| 19163 | 14 | < 0.1% |
| Other values (1068) | 2950 | 6.5% |
| (Missing) | 42183 |
| Value | Count | Frequency (%) |
| 10 | 8 | |
| 84 | 4 | |
| 119 | 3 | < 0.1% |
| 131 | 3 | < 0.1% |
| 151 | 6 | |
| 230 | 3 | < 0.1% |
| 263 | 3 | < 0.1% |
| 264 | 3 | < 0.1% |
| 295 | 5 | |
| 328 | 4 |
| Value | Count | Frequency (%) |
| 479888 | 2 | < 0.1% |
| 479549 | 1 | < 0.1% |
| 478947 | 2 | < 0.1% |
| 478628 | 12 | |
| 478442 | 1 | < 0.1% |
| 476066 | 1 | < 0.1% |
| 476065 | 2 | < 0.1% |
| 476063 | 2 | < 0.1% |
| 476056 | 2 | < 0.1% |
| 476054 | 2 | < 0.1% |
name_btc
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 1078 |
|---|---|
| Distinct (%) | 34.1% |
| Missing | 42183 |
| Missing (%) | 93.0% |
| Memory size | 354.4 KiB |
| TheBoweryBoys | 29 |
|---|---|
| TotòCollection | 27 |
| Zatôichi:TheBlindSwordsman | 26 |
| JamesBondCollection | 26 |
| TheCarryOnCollection | 25 |
| Other values (1073) |
Length
| Max length | 45 |
|---|---|
| Median length | 37 |
| Mean length | 21.648435 |
| Min length | 7 |
Characters and Unicode
| Total characters | 68474 |
|---|---|
| Distinct characters | 135 |
| Distinct categories | 11 ? |
| Distinct scripts | 6 ? |
| Distinct blocks | 7 ? |
Unique
| Unique | 170 ? |
|---|---|
| Unique (%) | 5.4% |
Sample
| 1st row | ToyStoryCollection |
|---|---|
| 2nd row | GrumpyOldMenCollection |
| 3rd row | FatheroftheBrideCollection |
| 4th row | JamesBondCollection |
| 5th row | BaltoCollection |
Common Values
| Value | Count | Frequency (%) |
| TheBoweryBoys | 29 | 0.1% |
| TotòCollection | 27 | 0.1% |
| Zatôichi:TheBlindSwordsman | 26 | 0.1% |
| JamesBondCollection | 26 | 0.1% |
| TheCarryOnCollection | 25 | 0.1% |
| PokémonCollection | 20 | < 0.1% |
| Godzilla(Showa)Collection | 16 | < 0.1% |
| CharlieChan(WarnerOland)Collection | 15 | < 0.1% |
| DragonBallZ(Movie)Collection | 15 | < 0.1% |
| TheLandBeforeTimeCollection | 14 | < 0.1% |
| Other values (1068) | 2950 | 6.5% |
| (Missing) | 42183 |
Length
| Value | Count | Frequency (%) |
| theboweryboys | 29 | 0.9% |
| totòcollection | 27 | 0.9% |
| zatôichi:theblindswordsman | 26 | 0.8% |
| jamesbondcollection | 26 | 0.8% |
| thecarryoncollection | 25 | 0.8% |
| pokémoncollection | 20 | 0.6% |
| godzilla(showa)collection | 16 | 0.5% |
| charliechan(warneroland)collection | 15 | 0.5% |
| dragonballz(movie)collection | 15 | 0.5% |
| thelandbeforetimecollection | 14 | 0.4% |
| Other values (1068) | 2950 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 7970 | |
| e | 7434 | |
| l | 7290 | |
| n | 5396 | 7.9% |
| i | 5388 | 7.9% |
| t | 4726 | 6.9% |
| c | 3467 | 5.1% |
| C | 3181 | 4.6% |
| a | 3088 | 4.5% |
| r | 2715 | 4.0% |
| Other values (125) | 17819 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 57455 | |
| Uppercase Letter | 9808 | 14.3% |
| Other Punctuation | 332 | 0.5% |
| Open Punctuation | 247 | 0.4% |
| Close Punctuation | 247 | 0.4% |
| Decimal Number | 242 | 0.4% |
| Dash Punctuation | 108 | 0.2% |
| Other Letter | 27 | < 0.1% |
| Final Punctuation | 3 | < 0.1% |
| Modifier Letter | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 7970 | |
| e | 7434 | |
| l | 7290 | |
| n | 5396 | |
| i | 5388 | |
| t | 4726 | |
| c | 3467 | |
| a | 3088 | 5.4% |
| r | 2715 | 4.7% |
| s | 1797 | 3.1% |
| Other values (53) | 8184 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 3181 | |
| T | 1050 | 10.7% |
| S | 746 | 7.6% |
| B | 490 | 5.0% |
| M | 443 | 4.5% |
| A | 393 | 4.0% |
| D | 370 | 3.8% |
| H | 362 | 3.7% |
| P | 305 | 3.1% |
| G | 289 | 2.9% |
| Other values (25) | 2179 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 61 | |
| 3 | 44 | |
| 9 | 44 | |
| 0 | 39 | |
| 2 | 18 | 7.4% |
| 8 | 12 | 5.0% |
| 5 | 10 | 4.1% |
| 6 | 6 | 2.5% |
| 7 | 6 | 2.5% |
| 4 | 2 | 0.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 112 | |
| : | 83 | |
| , | 50 | |
| & | 44 | 13.3% |
| ! | 18 | 5.4% |
| / | 17 | 5.1% |
| … | 3 | 0.9% |
| ? | 3 | 0.9% |
| * | 2 | 0.6% |
Other Letter
| Value | Count | Frequency (%) |
| ズ | 3 | |
| 男 | 3 | |
| は | 3 | |
| つ | 3 | |
| ら | 3 | |
| い | 3 | |
| よ | 3 | |
| シ | 3 | |
| リ | 3 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 243 | |
| [ | 4 | 1.6% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 243 | |
| ] | 4 | 1.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 106 | |
| – | 2 | 1.9% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 3 |
Modifier Letter
| Value | Count | Frequency (%) |
| ー | 3 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 67187 | |
| Common | 1184 | 1.7% |
| Cyrillic | 76 | 0.1% |
| Hiragana | 15 | < 0.1% |
| Katakana | 9 | < 0.1% |
| Han | 3 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 7970 | |
| e | 7434 | |
| l | 7290 | |
| n | 5396 | 8.0% |
| i | 5388 | 8.0% |
| t | 4726 | 7.0% |
| c | 3467 | 5.2% |
| C | 3181 | 4.7% |
| a | 3088 | 4.6% |
| r | 2715 | 4.0% |
| Other values (65) | 16532 |
Common
| Value | Count | Frequency (%) |
| ( | 243 | |
| ) | 243 | |
| . | 112 | |
| - | 106 | |
| : | 83 | 7.0% |
| 1 | 61 | 5.2% |
| , | 50 | 4.2% |
| & | 44 | 3.7% |
| 3 | 44 | 3.7% |
| 9 | 44 | 3.7% |
| Other values (18) | 154 |
Cyrillic
| Value | Count | Frequency (%) |
| л | 8 | 10.5% |
| о | 8 | 10.5% |
| и | 7 | 9.2% |
| к | 7 | 9.2% |
| а | 6 | 7.9% |
| р | 5 | 6.6% |
| е | 5 | 6.6% |
| я | 4 | 5.3% |
| ц | 3 | 3.9% |
| К | 3 | 3.9% |
| Other values (13) | 20 |
Hiragana
| Value | Count | Frequency (%) |
| は | 3 | |
| つ | 3 | |
| ら | 3 | |
| い | 3 | |
| よ | 3 |
Katakana
| Value | Count | Frequency (%) |
| ズ | 3 | |
| シ | 3 | |
| リ | 3 |
Han
| Value | Count | Frequency (%) |
| 男 | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68203 | |
| None | 157 | 0.2% |
| Cyrillic | 76 | 0.1% |
| Hiragana | 15 | < 0.1% |
| Katakana | 12 | < 0.1% |
| Punctuation | 8 | < 0.1% |
| CJK | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 7970 | |
| e | 7434 | |
| l | 7290 | |
| n | 5396 | 7.9% |
| i | 5388 | 7.9% |
| t | 4726 | 6.9% |
| c | 3467 | 5.1% |
| C | 3181 | 4.7% |
| a | 3088 | 4.5% |
| r | 2715 | 4.0% |
| Other values (65) | 17548 |
None
| Value | Count | Frequency (%) |
| ô | 29 | |
| é | 28 | |
| ò | 27 | |
| ä | 16 | |
| ı | 14 | |
| ö | 11 | 7.0% |
| í | 5 | 3.2% |
| İ | 4 | 2.5% |
| Ç | 2 | 1.3% |
| ü | 2 | 1.3% |
| Other values (14) | 19 |
Cyrillic
| Value | Count | Frequency (%) |
| л | 8 | 10.5% |
| о | 8 | 10.5% |
| и | 7 | 9.2% |
| к | 7 | 9.2% |
| а | 6 | 7.9% |
| р | 5 | 6.6% |
| е | 5 | 6.6% |
| я | 4 | 5.3% |
| ц | 3 | 3.9% |
| К | 3 | 3.9% |
| Other values (13) | 20 |
Katakana
| Value | Count | Frequency (%) |
| ズ | 3 | |
| ー | 3 | |
| シ | 3 | |
| リ | 3 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 3 | |
| … | 3 | |
| – | 2 |
CJK
| Value | Count | Frequency (%) |
| 男 | 3 |
Hiragana
| Value | Count | Frequency (%) |
| は | 3 | |
| つ | 3 | |
| ら | 3 | |
| い | 3 | |
| よ | 3 |
poster_btc
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 1078 |
|---|---|
| Distinct (%) | 34.1% |
| Missing | 42183 |
| Missing (%) | 93.0% |
| Memory size | 354.4 KiB |
| /q6sA4bzMT9cK7EEmXYwt7PNrL5h.jpg | 29 |
|---|---|
| /4ayJsjC3djGwU9eCWUokdBWvdLC.jpg | 27 |
| /8Q31DAtmFJjhFTwQGXghBUCgWK2.jpg | 26 |
| /HORpg5CSkmeQlAolx3bKMrKgfi.jpg | 26 |
| /2P0HNrYgKDvirV8RCdT1rBSJdbJ.jpg | 25 |
| Other values (1073) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 31.957951 |
| Min length | 31 |
Characters and Unicode
| Total characters | 101083 |
|---|---|
| Distinct characters | 64 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 170 ? |
|---|---|
| Unique (%) | 5.4% |
Sample
| 1st row | /7G9915LfUQ2lVfwMEEhDsn3kT4B.jpg |
|---|---|
| 2nd row | /nLvUdqgPgm3F85NMCii9gVFUcet.jpg |
| 3rd row | /nts4iOmNnq7GNicycMJ9pSAn204.jpg |
| 4th row | /HORpg5CSkmeQlAolx3bKMrKgfi.jpg |
| 5th row | /w0ZgH6Lgxt2bQYnf1ss74UvYftm.jpg |
Common Values
| Value | Count | Frequency (%) |
| /q6sA4bzMT9cK7EEmXYwt7PNrL5h.jpg | 29 | 0.1% |
| /4ayJsjC3djGwU9eCWUokdBWvdLC.jpg | 27 | 0.1% |
| /8Q31DAtmFJjhFTwQGXghBUCgWK2.jpg | 26 | 0.1% |
| /HORpg5CSkmeQlAolx3bKMrKgfi.jpg | 26 | 0.1% |
| /2P0HNrYgKDvirV8RCdT1rBSJdbJ.jpg | 25 | 0.1% |
| /j5te0YNZAMXDBnsqTUDKIBEt8iu.jpg | 20 | < 0.1% |
| /scvwS6k8gIW8w24UcmePQqVL10l.jpg | 16 | < 0.1% |
| /eSDdu6pbocmayu1SXQFU9VYYoQ6.jpg | 15 | < 0.1% |
| /2VMZ1zRFPnUQtQp5K4WRXvDYBjh.jpg | 15 | < 0.1% |
| /n1bjdBVThBezxR6nEf2dy43sTtV.jpg | 14 | < 0.1% |
| Other values (1068) | 2950 | 6.5% |
| (Missing) | 42183 |
Length
| Value | Count | Frequency (%) |
| q6sa4bzmt9ck7eemxywt7pnrl5h.jpg | 29 | 0.9% |
| 4ayjsjc3djgwu9ecwuokdbwvdlc.jpg | 27 | 0.9% |
| 8q31datmfjjhftwqgxghbucgwk2.jpg | 26 | 0.8% |
| horpg5cskmeqlaolx3bkmrkgfi.jpg | 26 | 0.8% |
| 2p0hnrygkdvirv8rcdt1rbsjdbj.jpg | 25 | 0.8% |
| j5te0ynzamxdbnsqtudkibet8iu.jpg | 20 | 0.6% |
| scvws6k8giw8w24ucmepqqvl10l.jpg | 16 | 0.5% |
| esddu6pbocmayu1sxqfu9vyyoq6.jpg | 15 | 0.5% |
| 2vmz1zrfpnuqtqp5k4wrxvdybjh.jpg | 15 | 0.5% |
| n1bjdbvthbezxr6nef2dy43sttv.jpg | 14 | 0.4% |
| Other values (1068) | 2950 |
Most occurring characters
| Value | Count | Frequency (%) |
| g | 4776 | 4.7% |
| p | 4566 | 4.5% |
| j | 4484 | 4.4% |
| / | 3163 | 3.1% |
| . | 3163 | 3.1% |
| m | 1591 | 1.6% |
| d | 1556 | 1.5% |
| C | 1530 | 1.5% |
| 5 | 1522 | 1.5% |
| k | 1512 | 1.5% |
| Other values (54) | 73220 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 46019 | |
| Uppercase Letter | 34963 | |
| Decimal Number | 13775 | 13.6% |
| Other Punctuation | 6326 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| g | 4776 | 10.4% |
| p | 4566 | 9.9% |
| j | 4484 | 9.7% |
| m | 1591 | 3.5% |
| d | 1556 | 3.4% |
| k | 1512 | 3.3% |
| c | 1483 | 3.2% |
| i | 1474 | 3.2% |
| f | 1466 | 3.2% |
| l | 1460 | 3.2% |
| Other values (16) | 21651 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1530 | 4.4% |
| U | 1488 | 4.3% |
| Q | 1462 | 4.2% |
| F | 1425 | 4.1% |
| D | 1413 | 4.0% |
| K | 1412 | 4.0% |
| Y | 1404 | 4.0% |
| S | 1403 | 4.0% |
| J | 1389 | 4.0% |
| X | 1386 | 4.0% |
| Other values (16) | 20651 |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 1522 | |
| 2 | 1429 | |
| 1 | 1404 | |
| 4 | 1394 | |
| 9 | 1383 | |
| 3 | 1370 | |
| 7 | 1360 | |
| 6 | 1340 | |
| 8 | 1339 | |
| 0 | 1234 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 3163 | |
| . | 3163 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 80982 | |
| Common | 20101 | 19.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| g | 4776 | 5.9% |
| p | 4566 | 5.6% |
| j | 4484 | 5.5% |
| m | 1591 | 2.0% |
| d | 1556 | 1.9% |
| C | 1530 | 1.9% |
| k | 1512 | 1.9% |
| U | 1488 | 1.8% |
| c | 1483 | 1.8% |
| i | 1474 | 1.8% |
| Other values (42) | 56522 |
Common
| Value | Count | Frequency (%) |
| / | 3163 | |
| . | 3163 | |
| 5 | 1522 | |
| 2 | 1429 | |
| 1 | 1404 | |
| 4 | 1394 | |
| 9 | 1383 | |
| 3 | 1370 | |
| 7 | 1360 | |
| 6 | 1340 | |
| Other values (2) | 2573 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 101083 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| g | 4776 | 4.7% |
| p | 4566 | 4.5% |
| j | 4484 | 4.4% |
| / | 3163 | 3.1% |
| . | 3163 | 3.1% |
| m | 1591 | 1.6% |
| d | 1556 | 1.5% |
| C | 1530 | 1.5% |
| 5 | 1522 | 1.5% |
| k | 1512 | 1.5% |
| Other values (54) | 73220 |
backdrop_btc
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 1077 |
|---|---|
| Distinct (%) | 34.0% |
| Missing | 42183 |
| Missing (%) | 93.0% |
| Memory size | 354.4 KiB |
| /foe3kuiJmg5AklhtD3skWbaTMf2.jpg | 29 |
|---|---|
| /jaUuprubvAxXLAY5hUfrNjxccUh.jpg | 27 |
| /bY8gLImMR5Pr9PaG3ZpobfaAQ8N.jpg | 26 |
| /6VcVl48kNKvdXOZfJPdarlUGOsk.jpg | 26 |
| /38tF1LJN7ULeZAuAfP7beaPMfcl.jpg | 25 |
| Other values (1072) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 31.976288 |
| Min length | 31 |
Characters and Unicode
| Total characters | 101141 |
|---|---|
| Distinct characters | 64 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 170 ? |
|---|---|
| Unique (%) | 5.4% |
Sample
| 1st row | /9FBwqcd9IRruEDUrTdcaafOMKUq.jpg |
|---|---|
| 2nd row | /hypTnLot2z8wpFS7qwsQHW1uV8u.jpg |
| 3rd row | /7qwE57OVZmMJChBpLEbJEmzUydk.jpg |
| 4th row | /6VcVl48kNKvdXOZfJPdarlUGOsk.jpg |
| 5th row | /9VM5LiJV0bGb1st1KyHA3cVnO2G.jpg |
Common Values
| Value | Count | Frequency (%) |
| /foe3kuiJmg5AklhtD3skWbaTMf2.jpg | 29 | 0.1% |
| /jaUuprubvAxXLAY5hUfrNjxccUh.jpg | 27 | 0.1% |
| /bY8gLImMR5Pr9PaG3ZpobfaAQ8N.jpg | 26 | 0.1% |
| /6VcVl48kNKvdXOZfJPdarlUGOsk.jpg | 26 | 0.1% |
| /38tF1LJN7ULeZAuAfP7beaPMfcl.jpg | 25 | 0.1% |
| /iGoYKA0TFfgSoZpG2u5viTJMGfK.jpg | 20 | < 0.1% |
| /dx9YSup5zEOjxYwG4UkYBVAZIXo.jpg | 16 | < 0.1% |
| /9bE62qBanBFtoiIc9cXjk1xW3w.jpg | 15 | < 0.1% |
| /7PcbijxTfwi9vjWEfXdS0ReAw8q.jpg | 15 | < 0.1% |
| /alkvR9vTtuZEmd5ygsayOfxYOMa.jpg | 14 | < 0.1% |
| Other values (1067) | 2950 | 6.5% |
| (Missing) | 42183 |
Length
| Value | Count | Frequency (%) |
| foe3kuijmg5aklhtd3skwbatmf2.jpg | 29 | 0.9% |
| jauuprubvaxxlay5hufrnjxccuh.jpg | 27 | 0.9% |
| by8glimmr5pr9pag3zpobfaaq8n.jpg | 26 | 0.8% |
| 6vcvl48knkvdxozfjpdarlugosk.jpg | 26 | 0.8% |
| 38tf1ljn7ulezauafp7beapmfcl.jpg | 25 | 0.8% |
| igoyka0tffgsozpg2u5vitjmgfk.jpg | 20 | 0.6% |
| dx9ysup5zeojxywg4ukybvazixo.jpg | 16 | 0.5% |
| 9be62qbanbftoiic9cxjk1xw3w.jpg | 15 | 0.5% |
| 7pcbijxtfwi9vjwefxds0reaw8q.jpg | 15 | 0.5% |
| alkvr9vttuzemd5ygsayofxyoma.jpg | 14 | 0.4% |
| Other values (1067) | 2950 |
Most occurring characters
| Value | Count | Frequency (%) |
| p | 4574 | 4.5% |
| j | 4560 | 4.5% |
| g | 4546 | 4.5% |
| / | 3163 | 3.1% |
| . | 3163 | 3.1% |
| k | 1685 | 1.7% |
| c | 1664 | 1.6% |
| f | 1616 | 1.6% |
| u | 1541 | 1.5% |
| 8 | 1530 | 1.5% |
| Other values (54) | 73099 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 45998 | |
| Uppercase Letter | 34678 | |
| Decimal Number | 14139 | 14.0% |
| Other Punctuation | 6326 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| p | 4574 | 9.9% |
| j | 4560 | 9.9% |
| g | 4546 | 9.9% |
| k | 1685 | 3.7% |
| c | 1664 | 3.6% |
| f | 1616 | 3.5% |
| u | 1541 | 3.4% |
| a | 1512 | 3.3% |
| i | 1470 | 3.2% |
| b | 1470 | 3.2% |
| Other values (16) | 21360 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1485 | 4.3% |
| Z | 1482 | 4.3% |
| T | 1450 | 4.2% |
| U | 1427 | 4.1% |
| Y | 1409 | 4.1% |
| N | 1408 | 4.1% |
| K | 1405 | 4.1% |
| M | 1396 | 4.0% |
| L | 1383 | 4.0% |
| G | 1382 | 4.0% |
| Other values (16) | 20451 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 1530 | |
| 9 | 1492 | |
| 5 | 1452 | |
| 7 | 1432 | |
| 2 | 1426 | |
| 1 | 1421 | |
| 3 | 1405 | |
| 0 | 1399 | |
| 6 | 1330 | |
| 4 | 1252 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 3163 | |
| . | 3163 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 80676 | |
| Common | 20465 | 20.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| p | 4574 | 5.7% |
| j | 4560 | 5.7% |
| g | 4546 | 5.6% |
| k | 1685 | 2.1% |
| c | 1664 | 2.1% |
| f | 1616 | 2.0% |
| u | 1541 | 1.9% |
| a | 1512 | 1.9% |
| A | 1485 | 1.8% |
| Z | 1482 | 1.8% |
| Other values (42) | 56011 |
Common
| Value | Count | Frequency (%) |
| / | 3163 | |
| . | 3163 | |
| 8 | 1530 | |
| 9 | 1492 | |
| 5 | 1452 | |
| 7 | 1432 | |
| 2 | 1426 | |
| 1 | 1421 | |
| 3 | 1405 | |
| 0 | 1399 | |
| Other values (2) | 2582 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 101141 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| p | 4574 | 4.5% |
| j | 4560 | 4.5% |
| g | 4546 | 4.5% |
| / | 3163 | 3.1% |
| . | 3163 | 3.1% |
| k | 1685 | 1.7% |
| c | 1664 | 1.6% |
| f | 1616 | 1.6% |
| u | 1541 | 1.5% |
| 8 | 1530 | 1.5% |
| Other values (54) | 73099 |
iso_639_1
Categorical
HIGH CARDINALITY  IMBALANCE  MISSING 
| Distinct | 1916 |
|---|---|
| Distinct (%) | 4.6% |
| Missing | 3792 |
| Missing (%) | 8.4% |
| Memory size | 354.4 KiB |
| en | |
|---|---|
| fr | 1850 |
| ja | 1287 |
| it | 1217 |
| es | 901 |
| Other values (1911) |
Length
| Max length | 38 |
|---|---|
| Median length | 2 |
| Mean length | 2.8379699 |
| Min length | 2 |
Characters and Unicode
| Total characters | 117929 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1358 ? |
|---|---|
| Unique (%) | 3.3% |
Sample
| 1st row | en |
|---|---|
| 2nd row | en,fr |
| 3rd row | en |
| 4th row | en |
| 5th row | en |
Common Values
| Value | Count | Frequency (%) |
| en | 22366 | |
| fr | 1850 | 4.1% |
| ja | 1287 | 2.8% |
| it | 1217 | 2.7% |
| es | 901 | 2.0% |
| ru | 807 | 1.8% |
| de | 760 | 1.7% |
| en,fr | 681 | 1.5% |
| en,es | 572 | 1.3% |
| hi | 480 | 1.1% |
| Other values (1906) | 10633 | |
| (Missing) | 3792 | 8.4% |
Length
| Value | Count | Frequency (%) |
| en | 22366 | |
| fr | 1850 | 4.5% |
| ja | 1287 | 3.1% |
| it | 1217 | 2.9% |
| es | 901 | 2.2% |
| ru | 807 | 1.9% |
| de | 760 | 1.8% |
| en,fr | 681 | 1.6% |
| en,es | 572 | 1.4% |
| hi | 480 | 1.2% |
| Other values (1906) | 10633 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 34323 | |
| n | 29775 | |
| , | 11607 | 9.8% |
| r | 6717 | 5.7% |
| f | 4729 | 4.0% |
| i | 3684 | 3.1% |
| t | 3680 | 3.1% |
| s | 3621 | 3.1% |
| d | 2983 | 2.5% |
| a | 2944 | 2.5% |
| Other values (17) | 13866 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 106322 | |
| Other Punctuation | 11607 | 9.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 34323 | |
| n | 29775 | |
| r | 6717 | 6.3% |
| f | 4729 | 4.4% |
| i | 3684 | 3.5% |
| t | 3680 | 3.5% |
| s | 3621 | 3.4% |
| d | 2983 | 2.8% |
| a | 2944 | 2.8% |
| h | 2351 | 2.2% |
| Other values (16) | 11515 | 10.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 11607 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 106322 | |
| Common | 11607 | 9.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 34323 | |
| n | 29775 | |
| r | 6717 | 6.3% |
| f | 4729 | 4.4% |
| i | 3684 | 3.5% |
| t | 3680 | 3.5% |
| s | 3621 | 3.4% |
| d | 2983 | 2.8% |
| a | 2944 | 2.8% |
| h | 2351 | 2.2% |
| Other values (16) | 11515 | 10.8% |
Common
| Value | Count | Frequency (%) |
| , | 11607 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 117929 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 34323 | |
| n | 29775 | |
| , | 11607 | 9.8% |
| r | 6717 | 5.7% |
| f | 4729 | 4.0% |
| i | 3684 | 3.1% |
| t | 3680 | 3.1% |
| s | 3621 | 3.1% |
| d | 2983 | 2.5% |
| a | 2944 | 2.5% |
| Other values (17) | 13866 |
language_name
Categorical
HIGH CARDINALITY  IMBALANCE  MISSING 
| Distinct | 1827 |
|---|---|
| Distinct (%) | 4.4% |
| Missing | 3915 |
| Missing (%) | 8.6% |
| Memory size | 354.4 KiB |
| English | |
|---|---|
| Français | 1850 |
| 日本語 | 1287 |
| Italiano | 1217 |
| Español | 901 |
| Other values (1822) |
Length
| Max length | 101 |
|---|---|
| Median length | 7 |
| Mean length | 9.0720234 |
| Min length | 1 |
Characters and Unicode
| Total characters | 375863 |
|---|---|
| Distinct characters | 169 |
| Distinct categories | 6 ? |
| Distinct scripts | 15 ? |
| Distinct blocks | 16 ? |
Unique
| Unique | 1285 ? |
|---|---|
| Unique (%) | 3.1% |
Sample
| 1st row | English |
|---|---|
| 2nd row | English,Français |
| 3rd row | English |
| 4th row | English |
| 5th row | English |
Common Values
| Value | Count | Frequency (%) |
| English | 22366 | |
| Français | 1850 | 4.1% |
| 日本語 | 1287 | 2.8% |
| Italiano | 1217 | 2.7% |
| Español | 901 | 2.0% |
| Pусский | 807 | 1.8% |
| Deutsch | 760 | 1.7% |
| English,Français | 681 | 1.5% |
| English,Español | 572 | 1.3% |
| हिन्दी | 480 | 1.1% |
| Other values (1817) | 10510 | |
| (Missing) | 3915 | 8.6% |
Length
| Value | Count | Frequency (%) |
| english | 22447 | |
| français | 1857 | 4.5% |
| 日本語 | 1288 | 3.1% |
| italiano | 1219 | 2.9% |
| español | 912 | 2.2% |
| pусский | 813 | 2.0% |
| deutsch | 764 | 1.8% |
| english,français | 689 | 1.7% |
| english,español | 576 | 1.4% |
| हिन्दी | 488 | 1.2% |
| Other values (1705) | 10378 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 42209 | |
| n | 37415 | 10.0% |
| i | 36983 | 9.8% |
| l | 34590 | 9.2% |
| h | 31428 | 8.4% |
| E | 31167 | 8.3% |
| g | 30383 | 8.1% |
| a | 18889 | 5.0% |
| , | 11607 | 3.1% |
| o | 7038 | 1.9% |
| Other values (159) | 94154 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 291316 | |
| Uppercase Letter | 46331 | 12.3% |
| Other Letter | 22160 | 5.9% |
| Other Punctuation | 12672 | 3.4% |
| Spacing Mark | 1836 | 0.5% |
| Nonspacing Mark | 1548 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 42209 | |
| n | 37415 | |
| i | 36983 | |
| l | 34590 | |
| h | 31428 | |
| g | 30383 | |
| a | 18889 | |
| o | 7038 | 2.4% |
| r | 6115 | 2.1% |
| t | 5943 | 2.0% |
| Other values (63) | 40323 |
Other Letter
| Value | Count | Frequency (%) |
| 語 | 1755 | 7.9% |
| 本 | 1755 | 7.9% |
| 日 | 1755 | 7.9% |
| 话 | 1263 | 5.7% |
| 州 | 946 | 4.3% |
| 通 | 790 | 3.6% |
| 普 | 790 | 3.6% |
| न | 706 | 3.2% |
| द | 706 | 3.2% |
| ह | 706 | 3.2% |
| Other values (46) | 10988 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 31167 | |
| F | 4189 | 9.0% |
| D | 2921 | 6.3% |
| P | 2662 | 5.7% |
| I | 2364 | 5.1% |
| N | 826 | 1.8% |
| L | 479 | 1.0% |
| M | 360 | 0.8% |
| T | 307 | 0.7% |
| Č | 281 | 0.6% |
| Other values (13) | 775 | 1.7% |
Spacing Mark
| Value | Count | Frequency (%) |
| ि | 706 | |
| ी | 706 | |
| ు | 136 | 7.4% |
| ி | 111 | 6.0% |
| া | 94 | 5.1% |
| ং | 47 | 2.6% |
| ਾ | 18 | 1.0% |
| ੀ | 18 | 1.0% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ् | 706 | |
| ִ | 430 | |
| ְ | 215 | 13.9% |
| ் | 111 | 7.2% |
| ె | 68 | 4.4% |
| ੰ | 18 | 1.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 11607 | |
| / | 1015 | 8.0% |
| ? | 50 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 325339 | |
| Common | 12672 | 3.4% |
| Han | 10473 | 2.8% |
| Cyrillic | 10381 | 2.8% |
| Devanagari | 4236 | 1.1% |
| Arabic | 3332 | 0.9% |
| Hangul | 3252 | 0.9% |
| Hebrew | 1720 | 0.5% |
| Greek | 1696 | 0.5% |
| Thai | 1225 | 0.3% |
| Other values (5) | 1537 | 0.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 42209 | |
| n | 37415 | |
| i | 36983 | |
| l | 34590 | |
| h | 31428 | |
| E | 31167 | |
| g | 30383 | |
| a | 18889 | 5.8% |
| o | 7038 | 2.2% |
| r | 6115 | 1.9% |
| Other values (50) | 49122 |
Cyrillic
| Value | Count | Frequency (%) |
| с | 3190 | |
| к | 1722 | |
| и | 1667 | |
| й | 1605 | |
| у | 1554 | |
| а | 112 | 1.1% |
| р | 86 | 0.8% |
| ь | 53 | 0.5% |
| ї | 53 | 0.5% |
| н | 53 | 0.5% |
| Other values (12) | 286 | 2.8% |
Arabic
| Value | Count | Frequency (%) |
| ر | 535 | |
| ا | 535 | |
| ة | 340 | |
| ي | 340 | |
| ب | 340 | |
| ع | 340 | |
| ل | 340 | |
| س | 140 | 4.2% |
| ف | 140 | 4.2% |
| ی | 140 | 4.2% |
| Other values (5) | 142 | 4.3% |
Han
| Value | Count | Frequency (%) |
| 語 | 1755 | |
| 本 | 1755 | |
| 日 | 1755 | |
| 话 | 1263 | |
| 州 | 946 | |
| 通 | 790 | |
| 普 | 790 | |
| 广 | 473 | 4.5% |
| 廣 | 473 | 4.5% |
| 話 | 473 | 4.5% |
Hebrew
| Value | Count | Frequency (%) |
| ִ | 430 | |
| ְ | 215 | |
| ת | 215 | |
| י | 215 | |
| ר | 215 | |
| ב | 215 | |
| ע | 215 |
Greek
| Value | Count | Frequency (%) |
| λ | 424 | |
| ά | 212 | |
| κ | 212 | |
| ν | 212 | |
| ι | 212 | |
| η | 212 | |
| ε | 212 |
Georgian
| Value | Count | Frequency (%) |
| ქ | 33 | |
| ი | 33 | |
| რ | 33 | |
| თ | 33 | |
| ა | 33 | |
| ლ | 33 | |
| უ | 33 |
Devanagari
| Value | Count | Frequency (%) |
| ि | 706 | |
| न | 706 | |
| ी | 706 | |
| द | 706 | |
| ् | 706 | |
| ह | 706 |
Hangul
| Value | Count | Frequency (%) |
| 어 | 542 | |
| 선 | 542 | |
| 말 | 542 | |
| 국 | 542 | |
| 한 | 542 | |
| 조 | 542 |
Thai
| Value | Count | Frequency (%) |
| า | 350 | |
| ภ | 175 | |
| ษ | 175 | |
| ไ | 175 | |
| ท | 175 | |
| ย | 175 |
Gurmukhi
| Value | Count | Frequency (%) |
| ਪ | 18 | |
| ੰ | 18 | |
| ਜ | 18 | |
| ਾ | 18 | |
| ਬ | 18 | |
| ੀ | 18 |
Telugu
| Value | Count | Frequency (%) |
| ు | 136 | |
| ె | 68 | |
| త | 68 | |
| గ | 68 | |
| ల | 68 |
Tamil
| Value | Count | Frequency (%) |
| ் | 111 | |
| ம | 111 | |
| த | 111 | |
| ி | 111 | |
| ழ | 111 |
Bengali
| Value | Count | Frequency (%) |
| া | 94 | |
| ং | 47 | |
| ল | 47 | |
| ব | 47 |
Common
| Value | Count | Frequency (%) |
| , | 11607 | |
| / | 1015 | 8.0% |
| ? | 50 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 329202 | |
| CJK | 10473 | 2.8% |
| Cyrillic | 10381 | 2.8% |
| None | 10379 | 2.8% |
| Devanagari | 4236 | 1.1% |
| Arabic | 3332 | 0.9% |
| Hangul | 3252 | 0.9% |
| Hebrew | 1720 | 0.5% |
| Thai | 1225 | 0.3% |
| Tamil | 555 | 0.1% |
| Other values (6) | 1108 | 0.3% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 42209 | |
| n | 37415 | |
| i | 36983 | |
| l | 34590 | |
| h | 31428 | |
| E | 31167 | |
| g | 30383 | |
| a | 18889 | 5.7% |
| , | 11607 | 3.5% |
| o | 7038 | 2.1% |
| Other values (37) | 47493 |
None
| Value | Count | Frequency (%) |
| ç | 4433 | |
| ñ | 2410 | |
| ê | 590 | 5.7% |
| λ | 424 | 4.1% |
| Č | 281 | 2.7% |
| ý | 281 | 2.7% |
| ü | 246 | 2.4% |
| ά | 212 | 2.0% |
| κ | 212 | 2.0% |
| ν | 212 | 2.0% |
| Other values (10) | 1078 | 10.4% |
Cyrillic
| Value | Count | Frequency (%) |
| с | 3190 | |
| к | 1722 | |
| и | 1667 | |
| й | 1605 | |
| у | 1554 | |
| а | 112 | 1.1% |
| р | 86 | 0.8% |
| ь | 53 | 0.5% |
| ї | 53 | 0.5% |
| н | 53 | 0.5% |
| Other values (12) | 286 | 2.8% |
CJK
| Value | Count | Frequency (%) |
| 語 | 1755 | |
| 本 | 1755 | |
| 日 | 1755 | |
| 话 | 1263 | |
| 州 | 946 | |
| 通 | 790 | |
| 普 | 790 | |
| 广 | 473 | 4.5% |
| 廣 | 473 | 4.5% |
| 話 | 473 | 4.5% |
Devanagari
| Value | Count | Frequency (%) |
| ि | 706 | |
| न | 706 | |
| ी | 706 | |
| द | 706 | |
| ् | 706 | |
| ह | 706 |
Hangul
| Value | Count | Frequency (%) |
| 어 | 542 | |
| 선 | 542 | |
| 말 | 542 | |
| 국 | 542 | |
| 한 | 542 | |
| 조 | 542 |
Arabic
| Value | Count | Frequency (%) |
| ر | 535 | |
| ا | 535 | |
| ة | 340 | |
| ي | 340 | |
| ب | 340 | |
| ع | 340 | |
| ل | 340 | |
| س | 140 | 4.2% |
| ف | 140 | 4.2% |
| ی | 140 | 4.2% |
| Other values (5) | 142 | 4.3% |
Hebrew
| Value | Count | Frequency (%) |
| ִ | 430 | |
| ְ | 215 | |
| ת | 215 | |
| י | 215 | |
| ר | 215 | |
| ב | 215 | |
| ע | 215 |
Thai
| Value | Count | Frequency (%) |
| า | 350 | |
| ภ | 175 | |
| ษ | 175 | |
| ไ | 175 | |
| ท | 175 | |
| ย | 175 |
Telugu
| Value | Count | Frequency (%) |
| ు | 136 | |
| ె | 68 | |
| త | 68 | |
| గ | 68 | |
| ల | 68 |
Tamil
| Value | Count | Frequency (%) |
| ் | 111 | |
| ம | 111 | |
| த | 111 | |
| ி | 111 | |
| ழ | 111 |
Bengali
| Value | Count | Frequency (%) |
| া | 94 | |
| ং | 47 | |
| ল | 47 | |
| ব | 47 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ế | 61 | |
| ệ | 61 |
Georgian
| Value | Count | Frequency (%) |
| ქ | 33 | |
| ი | 33 | |
| რ | 33 | |
| თ | 33 | |
| ა | 33 | |
| ლ | 33 | |
| უ | 33 |
Gurmukhi
| Value | Count | Frequency (%) |
| ਪ | 18 | |
| ੰ | 18 | |
| ਜ | 18 | |
| ਾ | 18 | |
| ਬ | 18 | |
| ੀ | 18 |
IPA Ext
| Value | Count | Frequency (%) |
| ə | 4 |
release_year
Real number (ℝ)
| Distinct | 135 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1991.8828 |
| Minimum | 1874 |
|---|---|
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.4 KiB |
Quantile statistics
| Minimum | 1874 |
|---|---|
| 5-th percentile | 1941 |
| Q1 | 1978 |
| median | 2001 |
| Q3 | 2010 |
| 95-th percentile | 2015 |
| Maximum | 2020 |
| Range | 146 |
| Interquartile range (IQR) | 32 |
Descriptive statistics
| Standard deviation | 24.05304 |
|---|---|
| Coefficient of variation (CV) | 0.01207553 |
| Kurtosis | 0.84037057 |
| Mean | 1991.8828 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -1.2247867 |
| Sum | 90323919 |
| Variance | 578.54874 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2014 | 1973 | 4.4% |
| 2015 | 1904 | 4.2% |
| 2013 | 1887 | 4.2% |
| 2012 | 1721 | 3.8% |
| 2011 | 1666 | 3.7% |
| 2016 | 1604 | 3.5% |
| 2009 | 1585 | 3.5% |
| 2010 | 1501 | 3.3% |
| 2008 | 1470 | 3.2% |
| 2007 | 1319 | 2.9% |
| Other values (125) | 28716 |
| Value | Count | Frequency (%) |
| 1874 | 1 | < 0.1% |
| 1878 | 1 | < 0.1% |
| 1883 | 1 | < 0.1% |
| 1887 | 1 | < 0.1% |
| 1888 | 2 | < 0.1% |
| 1890 | 5 | < 0.1% |
| 1891 | 6 | |
| 1892 | 3 | < 0.1% |
| 1893 | 1 | < 0.1% |
| 1894 | 13 |
| Value | Count | Frequency (%) |
| 2020 | 1 | < 0.1% |
| 2018 | 5 | < 0.1% |
| 2017 | 532 | 1.2% |
| 2016 | 1604 | |
| 2015 | 1904 | |
| 2014 | 1973 | |
| 2013 | 1887 | |
| 2012 | 1721 | |
| 2011 | 1666 | |
| 2010 | 1501 |
return
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 1256 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 660.47917 |
| Minimum | 0 |
|---|---|
| Maximum | 12396383 |
| Zeros | 40033 |
| Zeros (%) | 88.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 354.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2.5375 |
| Maximum | 12396383 |
| Range | 12396383 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 74717.996 |
|---|---|
| Coefficient of variation (CV) | 113.12695 |
| Kurtosis | 20659.288 |
| Mean | 660.47917 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 138.28379 |
| Sum | 29950088 |
| Variance | 5.582779 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 40033 | |
| 0.01 | 64 | 0.1% |
| 0.02 | 38 | 0.1% |
| 1 | 34 | 0.1% |
| 0.08 | 29 | 0.1% |
| 0.06 | 27 | 0.1% |
| 0.62 | 25 | 0.1% |
| 0.03 | 24 | 0.1% |
| 1.1 | 23 | 0.1% |
| 1.2 | 23 | 0.1% |
| Other values (1246) | 5026 | 11.1% |
| Value | Count | Frequency (%) |
| 0 | 40033 | |
| 0.01 | 64 | 0.1% |
| 0.02 | 38 | 0.1% |
| 0.03 | 24 | 0.1% |
| 0.04 | 19 | < 0.1% |
| 0.05 | 22 | < 0.1% |
| 0.06 | 27 | 0.1% |
| 0.07 | 18 | < 0.1% |
| 0.08 | 29 | 0.1% |
| 0.09 | 16 | < 0.1% |
| Value | Count | Frequency (%) |
| 12396383 | 1 | |
| 8500000 | 1 | |
| 4197476.62 | 1 | |
| 2755584 | 1 | |
| 1018619.28 | 1 | |
| 1000000 | 1 | |
| 26881.72 | 1 | |
| 12890.39 | 1 | |
| 5330.34 | 1 | |
| 4133.33 | 1 |
companies_id
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 22290 |
|---|---|
| Distinct (%) | 67.4% |
| Missing | 12264 |
| Missing (%) | 27.0% |
| Memory size | 354.4 KiB |
| 8411 | 742 |
|---|---|
| 6194 | 540 |
| 4 | 504 |
| 306 | 439 |
| 33 | 320 |
| Other values (22285) |
Length
| Max length | 146 |
|---|---|
| Median length | 124 |
| Mean length | 9.5989964 |
| Min length | 1 |
Characters and Unicode
| Total characters | 317554 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 19970 ? |
|---|---|
| Unique (%) | 60.4% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 559,2550,10201 |
| 3rd row | 6194,19464 |
| 4th row | 306 |
| 5th row | 5842,9195 |
Common Values
| Value | Count | Frequency (%) |
| 8411 | 742 | 1.6% |
| 6194 | 540 | 1.2% |
| 4 | 504 | 1.1% |
| 306 | 439 | 1.0% |
| 33 | 320 | 0.7% |
| 6 | 247 | 0.5% |
| 441 | 207 | 0.5% |
| 5 | 146 | 0.3% |
| 5120 | 145 | 0.3% |
| 2 | 85 | 0.2% |
| Other values (22280) | 29707 | |
| (Missing) | 12264 |
Length
| Value | Count | Frequency (%) |
| 8411 | 742 | 2.2% |
| 6194 | 540 | 1.6% |
| 4 | 504 | 1.5% |
| 306 | 439 | 1.3% |
| 33 | 320 | 1.0% |
| 6 | 247 | 0.7% |
| 441 | 207 | 0.6% |
| 5 | 146 | 0.4% |
| 5120 | 145 | 0.4% |
| 2 | 85 | 0.3% |
| Other values (22280) | 29707 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 42863 | |
| , | 35364 | |
| 2 | 31502 | |
| 3 | 30318 | |
| 4 | 29421 | |
| 6 | 27019 | |
| 5 | 26750 | |
| 8 | 24769 | |
| 7 | 23568 | |
| 9 | 23406 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 282190 | |
| Other Punctuation | 35364 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 42863 | |
| 2 | 31502 | |
| 3 | 30318 | |
| 4 | 29421 | |
| 6 | 27019 | |
| 5 | 26750 | |
| 8 | 24769 | |
| 7 | 23568 | |
| 9 | 23406 | |
| 0 | 22574 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 35364 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 317554 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 42863 | |
| , | 35364 | |
| 2 | 31502 | |
| 3 | 30318 | |
| 4 | 29421 | |
| 6 | 27019 | |
| 5 | 26750 | |
| 8 | 24769 | |
| 7 | 23568 | |
| 9 | 23406 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 317554 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 42863 | |
| , | 35364 | |
| 2 | 31502 | |
| 3 | 30318 | |
| 4 | 29421 | |
| 6 | 27019 | |
| 5 | 26750 | |
| 8 | 24769 | |
| 7 | 23568 | |
| 9 | 23406 |
companies_name
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 22240 |
|---|---|
| Distinct (%) | 67.2% |
| Missing | 12264 |
| Missing (%) | 27.0% |
| Memory size | 354.4 KiB |
| Metro-Goldwyn-Mayer(MGM) | 742 |
|---|---|
| WarnerBros. | 540 |
| ParamountPictures | 504 |
| TwentiethCenturyFoxFilmCorporation | 439 |
| UniversalPictures | 320 |
| Other values (22235) |
Length
| Max length | 531 |
|---|---|
| Median length | 313 |
| Mean length | 36.536394 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1208697 |
|---|---|
| Distinct characters | 288 |
| Distinct categories | 15 ? |
| Distinct scripts | 6 ? |
| Distinct blocks | 6 ? |
Unique
| Unique | 19909 ? |
|---|---|
| Unique (%) | 60.2% |
Sample
| 1st row | PixarAnimationStudios |
|---|---|
| 2nd row | TriStarPictures,TeitlerFilm,InterscopeCommunications |
| 3rd row | WarnerBros.,LancasterGate |
| 4th row | TwentiethCenturyFoxFilmCorporation |
| 5th row | SandollarProductions,TouchstonePictures |
Common Values
| Value | Count | Frequency (%) |
| Metro-Goldwyn-Mayer(MGM) | 742 | 1.6% |
| WarnerBros. | 540 | 1.2% |
| ParamountPictures | 504 | 1.1% |
| TwentiethCenturyFoxFilmCorporation | 439 | 1.0% |
| UniversalPictures | 320 | 0.7% |
| RKORadioPictures | 247 | 0.5% |
| ColumbiaPicturesCorporation | 207 | 0.5% |
| ColumbiaPictures | 146 | 0.3% |
| Mosfilm | 145 | 0.3% |
| WaltDisneyPictures | 85 | 0.2% |
| Other values (22230) | 29707 | |
| (Missing) | 12264 |
Length
| Value | Count | Frequency (%) |
| metro-goldwyn-mayer(mgm | 742 | 2.2% |
| warnerbros | 540 | 1.6% |
| paramountpictures | 504 | 1.5% |
| twentiethcenturyfoxfilmcorporation | 439 | 1.3% |
| universalpictures | 320 | 1.0% |
| rkoradiopictures | 247 | 0.7% |
| columbiapicturescorporation | 207 | 0.6% |
| columbiapictures | 146 | 0.4% |
| mosfilm | 145 | 0.4% |
| waltdisneypictures | 85 | 0.3% |
| Other values (22189) | 29707 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 103753 | 8.6% |
| e | 91275 | 7.6% |
| n | 87195 | 7.2% |
| o | 82733 | 6.8% |
| r | 81320 | 6.7% |
| t | 81276 | 6.7% |
| a | 74646 | 6.2% |
| s | 60744 | 5.0% |
| l | 49324 | 4.1% |
| m | 42956 | 3.6% |
| Other values (278) | 453475 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 956127 | |
| Uppercase Letter | 192473 | 15.9% |
| Other Punctuation | 42741 | 3.5% |
| Decimal Number | 4154 | 0.3% |
| Dash Punctuation | 4149 | 0.3% |
| Open Punctuation | 4140 | 0.3% |
| Close Punctuation | 4139 | 0.3% |
| Math Symbol | 594 | < 0.1% |
| Other Letter | 140 | < 0.1% |
| Other Symbol | 25 | < 0.1% |
| Other values (5) | 15 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 103753 | |
| e | 91275 | |
| n | 87195 | |
| o | 82733 | |
| r | 81320 | |
| t | 81276 | |
| a | 74646 | 7.8% |
| s | 60744 | 6.4% |
| l | 49324 | 5.2% |
| m | 42956 | 4.5% |
| Other values (102) | 200905 |
Other Letter
| Value | Count | Frequency (%) |
| 스 | 9 | 6.4% |
| 트 | 8 | 5.7% |
| 인 | 6 | 4.3% |
| 주 | 5 | 3.6% |
| 먼 | 5 | 3.6% |
| 테 | 5 | 3.6% |
| 터 | 5 | 3.6% |
| 엔 | 5 | 3.6% |
| 픽 | 4 | 2.9% |
| 디 | 3 | 2.1% |
| Other values (62) | 85 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 27288 | |
| F | 25447 | |
| C | 19698 | 10.2% |
| M | 12962 | 6.7% |
| S | 11571 | 6.0% |
| E | 9465 | 4.9% |
| A | 9161 | 4.8% |
| T | 9055 | 4.7% |
| B | 8751 | 4.5% |
| G | 7590 | 3.9% |
| Other values (52) | 51485 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 35757 | |
| . | 5536 | 13.0% |
| & | 744 | 1.7% |
| / | 625 | 1.5% |
| ! | 36 | 0.1% |
| % | 17 | < 0.1% |
| : | 9 | < 0.1% |
| @ | 5 | < 0.1% |
| ; | 3 | < 0.1% |
| # | 3 | < 0.1% |
| Other values (4) | 6 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 993 | |
| 1 | 676 | |
| 0 | 629 | |
| 3 | 519 | |
| 4 | 461 | |
| 9 | 198 | 4.8% |
| 6 | 192 | 4.6% |
| 7 | 170 | 4.1% |
| 8 | 159 | 3.8% |
| 5 | 157 | 3.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4130 | |
| [ | 9 | 0.2% |
| ( | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4129 | |
| ] | 9 | 0.2% |
| ) | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4147 | |
| – | 2 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 593 | |
| | | 1 | 0.2% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 23 | |
| ㈜ | 2 | 8.0% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 3 | |
| » | 3 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4 |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 3 |
Other Number
| Value | Count | Frequency (%) |
| ² | 1 |
Format
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1148197 | |
| Common | 59955 | 5.0% |
| Cyrillic | 373 | < 0.1% |
| Hangul | 115 | < 0.1% |
| Greek | 31 | < 0.1% |
| Han | 26 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 103753 | 9.0% |
| e | 91275 | 7.9% |
| n | 87195 | 7.6% |
| o | 82733 | 7.2% |
| r | 81320 | 7.1% |
| t | 81276 | 7.1% |
| a | 74646 | 6.5% |
| s | 60744 | 5.3% |
| l | 49324 | 4.3% |
| m | 42956 | 3.7% |
| Other values (99) | 392975 |
Hangul
| Value | Count | Frequency (%) |
| 스 | 9 | 7.8% |
| 트 | 8 | 7.0% |
| 인 | 6 | 5.2% |
| 주 | 5 | 4.3% |
| 먼 | 5 | 4.3% |
| 테 | 5 | 4.3% |
| 터 | 5 | 4.3% |
| 엔 | 5 | 4.3% |
| 픽 | 4 | 3.5% |
| 디 | 3 | 2.6% |
| Other values (43) | 60 |
Cyrillic
| Value | Count | Frequency (%) |
| и | 34 | 9.1% |
| о | 28 | 7.5% |
| а | 26 | 7.0% |
| л | 22 | 5.9% |
| н | 20 | 5.4% |
| м | 19 | 5.1% |
| т | 17 | 4.6% |
| ь | 16 | 4.3% |
| с | 16 | 4.3% |
| е | 16 | 4.3% |
| Other values (36) | 159 |
Common
| Value | Count | Frequency (%) |
| , | 35757 | |
| . | 5536 | 9.2% |
| - | 4147 | 6.9% |
| ( | 4130 | 6.9% |
| ) | 4129 | 6.9% |
| 2 | 993 | 1.7% |
| & | 744 | 1.2% |
| 1 | 676 | 1.1% |
| 0 | 629 | 1.0% |
| / | 625 | 1.0% |
| Other values (31) | 2589 | 4.3% |
Greek
| Value | Count | Frequency (%) |
| ο | 3 | 9.7% |
| ν | 3 | 9.7% |
| ρ | 2 | 6.5% |
| τ | 2 | 6.5% |
| Κ | 2 | 6.5% |
| ι | 2 | 6.5% |
| η | 2 | 6.5% |
| λ | 2 | 6.5% |
| Ε | 2 | 6.5% |
| ό | 1 | 3.2% |
| Other values (10) | 10 |
Han
| Value | Count | Frequency (%) |
| 影 | 2 | 7.7% |
| 北 | 2 | 7.7% |
| 京 | 2 | 7.7% |
| 有 | 2 | 7.7% |
| 限 | 2 | 7.7% |
| 公 | 2 | 7.7% |
| 司 | 2 | 7.7% |
| 乐 | 1 | 3.8% |
| 电 | 1 | 3.8% |
| 安 | 1 | 3.8% |
| Other values (9) | 9 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1203076 | |
| None | 5102 | 0.4% |
| Cyrillic | 373 | < 0.1% |
| Hangul | 113 | < 0.1% |
| CJK | 26 | < 0.1% |
| Punctuation | 7 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 103753 | 8.6% |
| e | 91275 | 7.6% |
| n | 87195 | 7.2% |
| o | 82733 | 6.9% |
| r | 81320 | 6.8% |
| t | 81276 | 6.8% |
| a | 74646 | 6.2% |
| s | 60744 | 5.0% |
| l | 49324 | 4.1% |
| m | 42956 | 3.6% |
| Other values (73) | 447854 |
None
| Value | Count | Frequency (%) |
| é | 2747 | |
| ó | 377 | 7.4% |
| á | 301 | 5.9% |
| í | 166 | 3.3% |
| ñ | 146 | 2.9% |
| ü | 143 | 2.8% |
| ä | 133 | 2.6% |
| ö | 127 | 2.5% |
| ô | 127 | 2.5% |
| ç | 118 | 2.3% |
| Other values (74) | 717 | 14.1% |
Cyrillic
| Value | Count | Frequency (%) |
| и | 34 | 9.1% |
| о | 28 | 7.5% |
| а | 26 | 7.0% |
| л | 22 | 5.9% |
| н | 20 | 5.4% |
| м | 19 | 5.1% |
| т | 17 | 4.6% |
| ь | 16 | 4.3% |
| с | 16 | 4.3% |
| е | 16 | 4.3% |
| Other values (36) | 159 |
Hangul
| Value | Count | Frequency (%) |
| 스 | 9 | 8.0% |
| 트 | 8 | 7.1% |
| 인 | 6 | 5.3% |
| 주 | 5 | 4.4% |
| 먼 | 5 | 4.4% |
| 테 | 5 | 4.4% |
| 터 | 5 | 4.4% |
| 엔 | 5 | 4.4% |
| 픽 | 4 | 3.5% |
| 디 | 3 | 2.7% |
| Other values (42) | 58 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 3 | |
| – | 2 | |
| • | 1 | 14.3% |
| | 1 | 14.3% |
CJK
| Value | Count | Frequency (%) |
| 影 | 2 | 7.7% |
| 北 | 2 | 7.7% |
| 京 | 2 | 7.7% |
| 有 | 2 | 7.7% |
| 限 | 2 | 7.7% |
| 公 | 2 | 7.7% |
| 司 | 2 | 7.7% |
| 乐 | 1 | 3.8% |
| 电 | 1 | 3.8% |
| 安 | 1 | 3.8% |
| Other values (9) | 9 |
countries_iso
Categorical
HIGH CARDINALITY  IMBALANCE  MISSING 
| Distinct | 2383 |
|---|---|
| Distinct (%) | 6.1% |
| Missing | 6213 |
| Missing (%) | 13.7% |
| Memory size | 354.4 KiB |
| US | |
|---|---|
| GB | |
| FR | 1652 |
| JP | 1354 |
| IT | 1029 |
| Other values (2378) |
Length
| Max length | 74 |
|---|---|
| Median length | 2 |
| Mean length | 2.782332 |
| Min length | 2 |
Characters and Unicode
| Total characters | 108881 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1764 ? |
|---|---|
| Unique (%) | 4.5% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
Common Values
| Value | Count | Frequency (%) |
| US | 17836 | |
| GB | 2235 | 4.9% |
| FR | 1652 | 3.6% |
| JP | 1354 | 3.0% |
| IT | 1029 | 2.3% |
| CA | 840 | 1.9% |
| DE | 748 | 1.6% |
| IN | 735 | 1.6% |
| RU | 734 | 1.6% |
| GB,US | 569 | 1.3% |
| Other values (2373) | 11401 | |
| (Missing) | 6213 | 13.7% |
Length
| Value | Count | Frequency (%) |
| us | 17836 | |
| gb | 2235 | 5.7% |
| fr | 1652 | 4.2% |
| jp | 1354 | 3.5% |
| it | 1029 | 2.6% |
| ca | 840 | 2.1% |
| de | 748 | 1.9% |
| in | 735 | 1.9% |
| ru | 734 | 1.9% |
| gb,us | 569 | 1.5% |
| Other values (2373) | 11401 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 23026 | |
| U | 23009 | |
| , | 10205 | |
| R | 6674 | 6.1% |
| B | 4977 | 4.6% |
| E | 4743 | 4.4% |
| G | 4445 | 4.1% |
| F | 4329 | 4.0% |
| I | 3999 | 3.7% |
| A | 3130 | 2.9% |
| Other values (17) | 20344 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 98676 | |
| Other Punctuation | 10205 | 9.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 23026 | |
| U | 23009 | |
| R | 6674 | 6.8% |
| B | 4977 | 5.0% |
| E | 4743 | 4.8% |
| G | 4445 | 4.5% |
| F | 4329 | 4.4% |
| I | 3999 | 4.1% |
| A | 3130 | 3.2% |
| T | 3000 | 3.0% |
| Other values (16) | 17344 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 10205 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 98676 | |
| Common | 10205 | 9.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 23026 | |
| U | 23009 | |
| R | 6674 | 6.8% |
| B | 4977 | 5.0% |
| E | 4743 | 4.8% |
| G | 4445 | 4.5% |
| F | 4329 | 4.4% |
| I | 3999 | 4.1% |
| A | 3130 | 3.2% |
| T | 3000 | 3.0% |
| Other values (16) | 17344 |
Common
| Value | Count | Frequency (%) |
| , | 10205 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 108881 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 23026 | |
| U | 23009 | |
| , | 10205 | |
| R | 6674 | 6.1% |
| B | 4977 | 4.6% |
| E | 4743 | 4.4% |
| G | 4445 | 4.1% |
| F | 4329 | 4.0% |
| I | 3999 | 3.7% |
| A | 3130 | 2.9% |
| Other values (17) | 20344 |
countries_name
Categorical
HIGH CARDINALITY  IMBALANCE  MISSING 
| Distinct | 2383 |
|---|---|
| Distinct (%) | 6.1% |
| Missing | 6213 |
| Missing (%) | 13.7% |
| Memory size | 354.4 KiB |
| UnitedStatesofAmerica | |
|---|---|
| UnitedKingdom | |
| France | 1652 |
| Japan | 1354 |
| Italy | 1029 |
| Other values (2378) |
Length
| Max length | 211 |
|---|---|
| Median length | 148 |
| Mean length | 17.008433 |
| Min length | 4 |
Characters and Unicode
| Total characters | 665591 |
|---|---|
| Distinct characters | 51 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1764 ? |
|---|---|
| Unique (%) | 4.5% |
Sample
| 1st row | UnitedStatesofAmerica |
|---|---|
| 2nd row | UnitedStatesofAmerica |
| 3rd row | UnitedStatesofAmerica |
| 4th row | UnitedStatesofAmerica |
| 5th row | UnitedStatesofAmerica |
Common Values
| Value | Count | Frequency (%) |
| UnitedStatesofAmerica | 17836 | |
| UnitedKingdom | 2235 | 4.9% |
| France | 1652 | 3.6% |
| Japan | 1354 | 3.0% |
| Italy | 1029 | 2.3% |
| Canada | 840 | 1.9% |
| Germany | 748 | 1.6% |
| India | 735 | 1.6% |
| Russia | 734 | 1.6% |
| UnitedKingdom,UnitedStatesofAmerica | 569 | 1.3% |
| Other values (2373) | 11401 | |
| (Missing) | 6213 | 13.7% |
Length
| Value | Count | Frequency (%) |
| unitedstatesofamerica | 17836 | |
| unitedkingdom | 2235 | 5.7% |
| france | 1652 | 4.2% |
| japan | 1354 | 3.5% |
| italy | 1029 | 2.6% |
| canada | 840 | 2.1% |
| germany | 748 | 1.9% |
| india | 735 | 1.9% |
| russia | 734 | 1.9% |
| unitedkingdom,unitedstatesofamerica | 569 | 1.5% |
| Other values (2373) | 11401 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 80562 | |
| t | 72563 | 10.9% |
| a | 70400 | 10.6% |
| i | 58494 | 8.8% |
| n | 47439 | 7.1% |
| d | 34515 | 5.2% |
| r | 32443 | 4.9% |
| o | 29543 | 4.4% |
| m | 28675 | 4.3% |
| c | 26338 | 4.0% |
| Other values (41) | 184619 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 557931 | |
| Uppercase Letter | 97455 | 14.6% |
| Other Punctuation | 10205 | 1.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 80562 | |
| t | 72563 | |
| a | 70400 | |
| i | 58494 | |
| n | 47439 | |
| d | 34515 | |
| r | 32443 | |
| o | 29543 | 5.3% |
| m | 28675 | 5.1% |
| c | 26338 | 4.7% |
| Other values (16) | 76959 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 25351 | |
| S | 23818 | |
| A | 22375 | |
| K | 5214 | 5.4% |
| F | 4321 | 4.4% |
| I | 3576 | 3.7% |
| C | 2591 | 2.7% |
| G | 2467 | 2.5% |
| J | 1661 | 1.7% |
| R | 1304 | 1.3% |
| Other values (14) | 4777 | 4.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 10205 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 655386 | |
| Common | 10205 | 1.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 80562 | |
| t | 72563 | |
| a | 70400 | |
| i | 58494 | 8.9% |
| n | 47439 | 7.2% |
| d | 34515 | 5.3% |
| r | 32443 | 5.0% |
| o | 29543 | 4.5% |
| m | 28675 | 4.4% |
| c | 26338 | 4.0% |
| Other values (40) | 174414 |
Common
| Value | Count | Frequency (%) |
| , | 10205 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 665591 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 80562 | |
| t | 72563 | 10.9% |
| a | 70400 | 10.6% |
| i | 58494 | 8.8% |
| n | 47439 | 7.1% |
| d | 34515 | 5.2% |
| r | 32443 | 4.9% |
| o | 29543 | 4.4% |
| m | 28675 | 4.3% |
| c | 26338 | 4.0% |
| Other values (41) | 184619 |
release_date
Categorical
| Distinct | 17333 |
|---|---|
| Distinct (%) | 38.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 354.4 KiB |
| 2008-01-01 | 136 |
|---|---|
| 2009-01-01 | 121 |
| 2007-01-01 | 117 |
| 2005-01-01 | 111 |
| 2006-01-01 | 101 |
| Other values (17328) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 453460 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 8579 ? |
|---|---|
| Unique (%) | 18.9% |
Sample
| 1st row | 1995-10-30 |
|---|---|
| 2nd row | 1995-12-15 |
| 3rd row | 1995-12-22 |
| 4th row | 1995-12-22 |
| 5th row | 1995-02-10 |
Common Values
| Value | Count | Frequency (%) |
| 2008-01-01 | 136 | 0.3% |
| 2009-01-01 | 121 | 0.3% |
| 2007-01-01 | 117 | 0.3% |
| 2005-01-01 | 111 | 0.2% |
| 2006-01-01 | 101 | 0.2% |
| 2002-01-01 | 96 | 0.2% |
| 2004-01-01 | 90 | 0.2% |
| 2001-01-01 | 84 | 0.2% |
| 2003-01-01 | 76 | 0.2% |
| 1997-01-01 | 69 | 0.2% |
| Other values (17323) | 44345 |
Length
| Value | Count | Frequency (%) |
| 2008-01-01 | 136 | 0.3% |
| 2009-01-01 | 121 | 0.3% |
| 2007-01-01 | 117 | 0.3% |
| 2005-01-01 | 111 | 0.2% |
| 2006-01-01 | 101 | 0.2% |
| 2002-01-01 | 96 | 0.2% |
| 2004-01-01 | 90 | 0.2% |
| 2001-01-01 | 84 | 0.2% |
| 2003-01-01 | 76 | 0.2% |
| 1997-01-01 | 69 | 0.2% |
| Other values (17323) | 44345 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 97532 | |
| - | 90692 | |
| 1 | 84002 | |
| 2 | 52761 | |
| 9 | 39752 | |
| 3 | 15418 | 3.4% |
| 8 | 15269 | 3.4% |
| 6 | 15010 | 3.3% |
| 5 | 14828 | 3.3% |
| 7 | 14282 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 362768 | |
| Dash Punctuation | 90692 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 97532 | |
| 1 | 84002 | |
| 2 | 52761 | |
| 9 | 39752 | |
| 3 | 15418 | 4.3% |
| 8 | 15269 | 4.2% |
| 6 | 15010 | 4.1% |
| 5 | 14828 | 4.1% |
| 7 | 14282 | 3.9% |
| 4 | 13914 | 3.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 90692 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 453460 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 97532 | |
| - | 90692 | |
| 1 | 84002 | |
| 2 | 52761 | |
| 9 | 39752 | |
| 3 | 15418 | 3.4% |
| 8 | 15269 | 3.4% |
| 6 | 15010 | 3.3% |
| 5 | 14828 | 3.3% |
| 7 | 14282 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 453460 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 97532 | |
| - | 90692 | |
| 1 | 84002 | |
| 2 | 52761 | |
| 9 | 39752 | |
| 3 | 15418 | 3.4% |
| 8 | 15269 | 3.4% |
| 6 | 15010 | 3.3% |
| 5 | 14828 | 3.3% |
| 7 | 14282 | 3.1% |
month_time
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 354.4 KiB |
| enero | |
|---|---|
| septiembre | |
| octubre | |
| diciembre | |
| noviembre | |
| Other values (7) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 6.5277202 |
| Min length | 4 |
Characters and Unicode
| Total characters | 296006 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | octubre |
|---|---|
| 2nd row | diciembre |
| 3rd row | diciembre |
| 4th row | diciembre |
| 5th row | febrero |
Common Values
| Value | Count | Frequency (%) |
| enero | 5909 | |
| septiembre | 4834 | |
| octubre | 4613 | |
| diciembre | 3781 | |
| noviembre | 3661 | |
| marzo | 3549 | |
| abril | 3452 | |
| agosto | 3393 | |
| mayo | 3337 | |
| junio | 3151 | |
| Other values (2) | 5666 |
Length
| Value | Count | Frequency (%) |
| enero | 5909 | |
| septiembre | 4834 | |
| octubre | 4613 | |
| diciembre | 3781 | |
| noviembre | 3661 | |
| marzo | 3549 | |
| abril | 3452 | |
| agosto | 3393 | |
| mayo | 3337 | |
| junio | 3151 | |
| Other values (2) | 5666 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 51873 | |
| o | 36672 | |
| r | 35855 | |
| i | 25298 | |
| b | 23369 | |
| m | 19162 | 6.5% |
| a | 13731 | 4.6% |
| t | 12840 | 4.3% |
| n | 12721 | 4.3% |
| u | 10402 | 3.5% |
| Other values (11) | 54083 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 296006 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 51873 | |
| o | 36672 | |
| r | 35855 | |
| i | 25298 | |
| b | 23369 | |
| m | 19162 | 6.5% |
| a | 13731 | 4.6% |
| t | 12840 | 4.3% |
| n | 12721 | 4.3% |
| u | 10402 | 3.5% |
| Other values (11) | 54083 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 296006 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 51873 | |
| o | 36672 | |
| r | 35855 | |
| i | 25298 | |
| b | 23369 | |
| m | 19162 | 6.5% |
| a | 13731 | 4.6% |
| t | 12840 | 4.3% |
| n | 12721 | 4.3% |
| u | 10402 | 3.5% |
| Other values (11) | 54083 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 296006 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 51873 | |
| o | 36672 | |
| r | 35855 | |
| i | 25298 | |
| b | 23369 | |
| m | 19162 | 6.5% |
| a | 13731 | 4.6% |
| t | 12840 | 4.3% |
| n | 12721 | 4.3% |
| u | 10402 | 3.5% |
| Other values (11) | 54083 |
day_time
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 354.4 KiB |
| viernes | |
|---|---|
| jueves | |
| miercoles | |
| sabado | |
| martes | |
| Other values (2) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 6.7738941 |
| Min length | 5 |
Characters and Unicode
| Total characters | 307169 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | lunes |
|---|---|
| 2nd row | viernes |
| 3rd row | viernes |
| 4th row | viernes |
| 5th row | viernes |
Common Values
| Value | Count | Frequency (%) |
| viernes | 13902 | |
| jueves | 7520 | |
| miercoles | 7027 | |
| sabado | 5149 | 11.4% |
| martes | 4644 | 10.2% |
| domingo | 3607 | 8.0% |
| lunes | 3497 | 7.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| viernes | 13902 | |
| jueves | 7520 | |
| miercoles | 7027 | |
| sabado | 5149 | 11.4% |
| martes | 4644 | 10.2% |
| domingo | 3607 | 8.0% |
| lunes | 3497 | 7.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 65039 | |
| s | 41739 | |
| r | 25573 | 8.3% |
| i | 24536 | 8.0% |
| v | 21422 | 7.0% |
| n | 21006 | 6.8% |
| o | 19390 | 6.3% |
| m | 15278 | 5.0% |
| a | 14942 | 4.9% |
| u | 11017 | 3.6% |
| Other values (7) | 47227 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 307169 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 65039 | |
| s | 41739 | |
| r | 25573 | 8.3% |
| i | 24536 | 8.0% |
| v | 21422 | 7.0% |
| n | 21006 | 6.8% |
| o | 19390 | 6.3% |
| m | 15278 | 5.0% |
| a | 14942 | 4.9% |
| u | 11017 | 3.6% |
| Other values (7) | 47227 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 307169 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 65039 | |
| s | 41739 | |
| r | 25573 | 8.3% |
| i | 24536 | 8.0% |
| v | 21422 | 7.0% |
| n | 21006 | 6.8% |
| o | 19390 | 6.3% |
| m | 15278 | 5.0% |
| a | 14942 | 4.9% |
| u | 11017 | 3.6% |
| Other values (7) | 47227 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 307169 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 65039 | |
| s | 41739 | |
| r | 25573 | 8.3% |
| i | 24536 | 8.0% |
| v | 21422 | 7.0% |
| n | 21006 | 6.8% |
| o | 19390 | 6.3% |
| m | 15278 | 5.0% |
| a | 14942 | 4.9% |
| u | 11017 | 3.6% |
| Other values (7) | 47227 |
| id | popularity | vote_average | vote_count | runtime | budget | revenue | id_btc | release_year | return | status | original_language | month_time | day_time | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| id | 1.000 | -0.410 | -0.149 | -0.433 | -0.214 | -0.255 | -0.278 | 0.445 | 0.392 | -0.263 | 0.056 | 0.071 | 0.038 | 0.040 |
| popularity | -0.410 | 1.000 | 0.241 | 0.894 | 0.315 | 0.463 | 0.491 | -0.309 | 0.186 | 0.446 | 0.000 | 0.000 | 0.006 | 0.004 |
| vote_average | -0.149 | 0.241 | 1.000 | 0.317 | 0.196 | 0.072 | 0.127 | -0.027 | -0.009 | 0.121 | 0.019 | 0.070 | 0.026 | 0.044 |
| vote_count | -0.433 | 0.894 | 0.317 | 1.000 | 0.298 | 0.484 | 0.513 | -0.330 | 0.197 | 0.473 | 0.000 | 0.000 | 0.021 | 0.029 |
| runtime | -0.214 | 0.315 | 0.196 | 0.298 | 1.000 | 0.229 | 0.255 | -0.170 | 0.032 | 0.235 | 0.000 | 0.111 | 0.026 | 0.028 |
| budget | -0.255 | 0.463 | 0.072 | 0.484 | 0.229 | 1.000 | 0.644 | -0.290 | 0.141 | 0.771 | 0.000 | 0.000 | 0.035 | 0.040 |
| revenue | -0.278 | 0.491 | 0.127 | 0.513 | 0.255 | 0.644 | 1.000 | -0.320 | 0.103 | 0.849 | 0.000 | 0.000 | 0.029 | 0.025 |
| id_btc | 0.445 | -0.309 | -0.027 | -0.330 | -0.170 | -0.290 | -0.320 | 1.000 | 0.094 | -0.307 | 0.000 | 0.170 | 0.046 | 0.072 |
| release_year | 0.392 | 0.186 | -0.009 | 0.197 | 0.032 | 0.141 | 0.103 | 0.094 | 1.000 | 0.085 | 0.028 | 0.144 | 0.044 | 0.082 |
| return | -0.263 | 0.446 | 0.121 | 0.473 | 0.235 | 0.771 | 0.849 | -0.307 | 0.085 | 1.000 | 0.000 | 0.000 | 0.006 | 0.000 |
| status | 0.056 | 0.000 | 0.019 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.028 | 0.000 | 1.000 | 0.000 | 0.005 | 0.004 |
| original_language | 0.071 | 0.000 | 0.070 | 0.000 | 0.111 | 0.000 | 0.000 | 0.170 | 0.144 | 0.000 | 0.000 | 1.000 | 0.045 | 0.171 |
| month_time | 0.038 | 0.006 | 0.026 | 0.021 | 0.026 | 0.035 | 0.029 | 0.046 | 0.044 | 0.006 | 0.005 | 0.045 | 1.000 | 0.048 |
| day_time | 0.040 | 0.004 | 0.044 | 0.029 | 0.028 | 0.040 | 0.025 | 0.072 | 0.082 | 0.000 | 0.004 | 0.171 | 0.048 | 1.000 |
| id | title | overview | popularity | vote_average | vote_count | status | original_language | runtime | budget | revenue | tagline | id_btc | name_btc | poster_btc | backdrop_btc | iso_639_1 | language_name | release_year | return | companies_id | companies_name | countries_iso | countries_name | release_date | month_time | day_time | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 862 | Toy Story | LedbyWoody,Andy'stoyslivehappilyinhisroomuntilAndy'sbirthdaybringsBuzzLightyearontothescene.AfraidoflosinghisplaceinAndy'sheart,WoodyplotsagainstBuzz.ButwhencircumstancesseparateBuzzandWoodyfromtheirowner,theduoeventuallylearnstoputasidetheirdifferences. | 21.946943 | 7.7 | 5415.0 | Released | en | 81 | 30000000.0 | 373554033.0 | NaN | 10194.0 | ToyStoryCollection | /7G9915LfUQ2lVfwMEEhDsn3kT4B.jpg | /9FBwqcd9IRruEDUrTdcaafOMKUq.jpg | en | English | 1995 | 12.45 | 3 | PixarAnimationStudios | US | UnitedStatesofAmerica | 1995-10-30 | octubre | lunes |
| 1 | 8844 | Jumanji | WhensiblingsJudyandPeterdiscoveranenchantedboardgamethatopensthedoortoamagicalworld,theyunwittinglyinviteAlan--anadultwho'sbeentrappedinsidethegamefor26years--intotheirlivingroom.Alan'sonlyhopeforfreedomistofinishthegame,whichprovesriskyasallthreefindthemselvesrunningfromgiantrhinoceroses,evilmonkeysandotherterrifyingcreatures. | 17.015539 | 6.9 | 2413.0 | Released | en | 104 | 65000000.0 | 262797249.0 | Rollthediceandunleashtheexcitement! | NaN | NaN | NaN | NaN | en,fr | English,Français | 1995 | 4.04 | 559,2550,10201 | TriStarPictures,TeitlerFilm,InterscopeCommunications | US | UnitedStatesofAmerica | 1995-12-15 | diciembre | viernes |
| 2 | 15602 | Grumpier Old Men | Afamilyweddingreignitestheancientfeudbetweennext-doorneighborsandfishingbuddiesJohnandMax.Meanwhile,asultryItaliandivorcéeopensarestaurantatthelocalbaitshop,alarmingthelocalswhoworryshe'llscarethefishaway.Butshe'slessinterestedinseafoodthansheisincookingupahottimewithMax. | 11.712900 | 6.5 | 92.0 | Released | en | 101 | 0.0 | 0.0 | StillYelling.StillFighting.StillReadyforLove. | 119050.0 | GrumpyOldMenCollection | /nLvUdqgPgm3F85NMCii9gVFUcet.jpg | /hypTnLot2z8wpFS7qwsQHW1uV8u.jpg | en | English | 1995 | 0.00 | 6194,19464 | WarnerBros.,LancasterGate | US | UnitedStatesofAmerica | 1995-12-22 | diciembre | viernes |
| 3 | 31357 | Waiting to Exhale | Cheatedon,mistreatedandsteppedon,thewomenareholdingtheirbreath,waitingfortheelusive"goodman"tobreakastringofless-than-stellarlovers.FriendsandconfidantsVannah,Bernie,GloandRobintalkitallout,determinedtofindabetterwaytobreathe. | 3.859495 | 6.1 | 34.0 | Released | en | 127 | 16000000.0 | 81452156.0 | Friendsarethepeoplewholetyoubeyourself...andneverletyouforgetit. | NaN | NaN | NaN | NaN | en | English | 1995 | 5.09 | 306 | TwentiethCenturyFoxFilmCorporation | US | UnitedStatesofAmerica | 1995-12-22 | diciembre | viernes |
| 4 | 11862 | Father of the Bride Part II | JustwhenGeorgeBankshasrecoveredfromhisdaughter'swedding,hereceivesthenewsthatshe'spregnant...andthatGeorge'swife,Nina,isexpectingtoo.Hewasplanningonsellingtheirhome,butthat'saplanthat--likeGeorge--willhavetochangewiththearrivalofbothagrandchildandakidofhisown. | 8.387519 | 5.7 | 173.0 | Released | en | 106 | 0.0 | 76578911.0 | JustWhenHisWorldIsBackToNormal...He'sInForTheSurpriseOfHisLife! | 96871.0 | FatheroftheBrideCollection | /nts4iOmNnq7GNicycMJ9pSAn204.jpg | /7qwE57OVZmMJChBpLEbJEmzUydk.jpg | en | English | 1995 | 0.00 | 5842,9195 | SandollarProductions,TouchstonePictures | US | UnitedStatesofAmerica | 1995-02-10 | febrero | viernes |
| 5 | 949 | Heat | Obsessivemasterthief,NeilMcCauleyleadsatop-notchcrewonvariousinsaneheiststhroughoutLosAngeleswhileamentallyunstabledetective,VincentHannapursueshimwithoutrest.Eachmanrecognizesandrespectstheabilityandthededicationoftheothereventhoughtheyareawaretheircat-and-mousegamemayendinviolence. | 17.924927 | 7.7 | 1886.0 | Released | en | 170 | 60000000.0 | 187436818.0 | ALosAngelesCrimeSaga | NaN | NaN | NaN | NaN | en,es | English,Español | 1995 | 3.12 | 508,675,6194 | RegencyEnterprises,ForwardPass,WarnerBros. | US | UnitedStatesofAmerica | 1995-12-15 | diciembre | viernes |
| 6 | 11860 | Sabrina | Anuglyducklinghavingundergonearemarkablechange,stillharborsfeelingsforhercrush:acarefreeplayboy,butnotbeforehisbusiness-focusedbrotherhassomethingtosayaboutit. | 6.677277 | 6.2 | 141.0 | Released | en | 127 | 58000000.0 | 0.0 | Youarecordiallyinvitedtothemostsurprisingmergeroftheyear. | NaN | NaN | NaN | NaN | fr,en | Français,English | 1995 | 0.00 | 4,258,932,5842,14941,55873,58079 | ParamountPictures,ScottRudinProductions,MirageEnterprises,SandollarProductions,ConstellationEntertainment,Worldwide,MontBlancEntertainmentGmbH | DE,US | Germany,UnitedStatesofAmerica | 1995-12-15 | diciembre | viernes |
| 7 | 45325 | Tom and Huck | Amischievousyoungboy,TomSawyer,witnessesamurderbythedeadlyInjunJoe.TombecomesfriendswithHuckleberryFinn,aboywithnofutureandnofamily.Tomhastochoosebetweenhonoringafriendshiporhonoringanoathbecausethetownalcoholicisaccusedofthemurder.TomandHuckgothroughseveraladventurestryingtoretrieveevidence. | 2.561161 | 5.4 | 45.0 | Released | en | 97 | 0.0 | 0.0 | TheOriginalBadBoys. | NaN | NaN | NaN | NaN | en,de | English,Deutsch | 1995 | 0.00 | 2 | WaltDisneyPictures | US | UnitedStatesofAmerica | 1995-12-22 | diciembre | viernes |
| 8 | 9091 | Sudden Death | InternationalactionsuperstarJeanClaudeVanDammeteamswithPowersBootheinaTension-packed,suspensethriller,setagainsttheback-dropofaStanleyCupgame.VanDammeportraysafatherwhosedaughterissuddenlytakenduringachampionshiphockeygame.Withthecaptorsdemandingabilliondollarsbygame'send,VanDammefranticallysetsaplaninmotiontorescuehisdaughterandabortanimpendingexplosionbeforethefinalbuzzer... | 5.231580 | 5.5 | 174.0 | Released | en | 106 | 35000000.0 | 64350171.0 | Terrorgoesintoovertime. | NaN | NaN | NaN | NaN | en | English | 1995 | 1.84 | 33,21437,23770 | UniversalPictures,ImperialEntertainment,SignatureEntertainment | US | UnitedStatesofAmerica | 1995-12-22 | diciembre | viernes |
| 9 | 710 | GoldenEye | JamesBondmustunmaskthemysteriousheadoftheJanusSyndicateandpreventtheleaderfromutilizingtheGoldenEyeweaponssystemtoinflictdevastatingrevengeonBritain. | 14.686036 | 6.6 | 1194.0 | Released | en | 130 | 58000000.0 | 352194034.0 | Nolimits.Nofears.Nosubstitutes. | 645.0 | JamesBondCollection | /HORpg5CSkmeQlAolx3bKMrKgfi.jpg | /6VcVl48kNKvdXOZfJPdarlUGOsk.jpg | en,ru,es | English,Pусский,Español | 1995 | 6.07 | 60,7576 | UnitedArtists,EonProductions | GB,US | UnitedKingdom,UnitedStatesofAmerica | 1995-11-16 | noviembre | jueves |
| id | title | overview | popularity | vote_average | vote_count | status | original_language | runtime | budget | revenue | tagline | id_btc | name_btc | poster_btc | backdrop_btc | iso_639_1 | language_name | release_year | return | companies_id | companies_name | countries_iso | countries_name | release_date | month_time | day_time | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 45336 | 67179 | St. Michael Had a Rooster | Sentencedtolifeimprisonmentforillegalactivities,ItalianInternationalmemberGiulioManieriholdsontohispoliticalidealswhilestrugglingagainstmadnessinthelonelinessofhisprisoncell. | 0.225051 | 6.0 | 3.0 | Released | it | 90 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | it | Italiano | 1972 | 0.0 | NaN | NaN | NaN | NaN | 1972-01-01 | enero | sabado |
| 45337 | 84419 | House of Horrors | Anunsuccessfulsculptorsavesamadmannamed"TheCreeper"fromdrowning.Seeinganopportunityforrevenge,hetricksthepsychointomurderinghiscritics. | 0.222814 | 6.3 | 8.0 | Released | en | 65 | 0.0 | 0.0 | Meet...TheCREEPER! | NaN | NaN | NaN | NaN | en | English | 1946 | 0.0 | 33 | UniversalPictures | US | UnitedStatesofAmerica | 1946-03-29 | marzo | viernes |
| 45338 | 390959 | Shadow of the Blair Witch | Inthistrue-crimedocumentary,wedelveintothemurderspreethatwastheinspirationforJoeBerlinger's"BookofShadows:BlairWitch2". | 0.076061 | 7.0 | 2.0 | Released | en | 45 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | en | English | 2000 | 0.0 | NaN | NaN | NaN | NaN | 2000-10-22 | octubre | domingo |
| 45339 | 289923 | The Burkittsville 7 | AfilmarchivistrevisitsthestoryofRustinParr,ahermitthoughttohavemurderedsevenchildrenwhileunderthepossessionoftheBlairWitch. | 0.386450 | 7.0 | 1.0 | Released | en | 30 | 0.0 | 0.0 | Doyouknowwhathappened50yearsbefore"TheBlairWitchProject"? | NaN | NaN | NaN | NaN | en | English | 2000 | 0.0 | 27570,27571 | NeptuneSaladEntertainment,PirieProductions | US | UnitedStatesofAmerica | 2000-10-03 | octubre | martes |
| 45340 | 222848 | Caged Heat 3000 | It'stheyear3000AD.Theworld'smostdangerouswomenarebanishedtoaremoteasteroid45millionlightyearsfromearth.KiraMurphydoesn'tbelong;wrongfullyaccusedofacrimeshedidnotcommit,she'sthrowninthisinterplanetaryprisonandlefttoherowndefenses.ButKira'safighter,andsoonshefindsherselfinthemiddleofafemalegangwar;whereeveryonewantsapieceoftheaction...andapieceofher!"CagedHeat3000"takestheWomen-in-Prisongenretoawholenewlevel...andawholenewgalaxy! | 0.661558 | 3.5 | 1.0 | Released | en | 85 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | en | English | 1995 | 0.0 | 4688 | Concorde-NewHorizons | US | UnitedStatesofAmerica | 1995-01-01 | enero | domingo |
| 45341 | 30840 | Robin Hood | Yetanotherversionoftheclassicepic,withenoughvariationtomakeitinteresting.Thestoryisthesame,butsomeofthecharactersarequitedifferentfromtheusual,inparticularUmaThurman'sveryspecialmaidMarian.Thephotographyisalsogreat,givingthestoryasomewhatdarkertone. | 5.683753 | 5.7 | 26.0 | Released | en | 104 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | en | English | 1991 | 0.0 | 7025,10163,16323,38978 | WestdeutscherRundfunk(WDR),WorkingTitleFilms,20thCenturyFoxTelevision,CanWestGlobalCommunications | CA,DE,GB,US | Canada,Germany,UnitedKingdom,UnitedStatesofAmerica | 1991-05-13 | mayo | lunes |
| 45342 | 111109 | Century of Birthing | Anartiststrugglestofinishhisworkwhileastorylineaboutacultplaysinhishead. | 0.178241 | 9.0 | 3.0 | Released | tl | 360 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | tl | NaN | 2011 | 0.0 | 19653 | SineOlivia | PH | Philippines | 2011-11-17 | noviembre | jueves |
| 45343 | 67758 | Betrayal | Whenoneofherhitsgoeswrong,aprofessionalassassinendsupwithasuitcasefullofamilliondollarsbelongingtoamobboss... | 0.903007 | 3.8 | 6.0 | Released | en | 90 | 0.0 | 0.0 | Adeadlygameofwits. | NaN | NaN | NaN | NaN | en | English | 2003 | 0.0 | 6165 | AmericanWorldPictures | US | UnitedStatesofAmerica | 2003-08-01 | agosto | viernes |
| 45344 | 227506 | Satan Triumphant | Inasmalltownlivetwobrothers,oneaministerandtheotheroneahunchbackpainterofthechapelwholiveswithhiswife.Onedreadfulandstormynight,astrangerknocksatthedooraskingforshelter.Thestrangertalksaboutallthegoodthingsoftheearthlylifetheministerismissingbecauseofhispuritanicalfaith.Theministercomestoacceptthestranger'sviewpointbutitisotherswhowillpaytheconsequencesbecausetheministerwilldiscoverthehumanpleasuresthanksto,ehem,hissister-in-law…Thetormentedministerandhiscuckoldedbrotherwilldieinastrangeaccidentinthechapelandlateraninfantwillbebornfromtheminister'sadulterousrelationship. | 0.003503 | 0.0 | 0.0 | Released | en | 87 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1917 | 0.0 | 88753 | Yermoliev | RU | Russia | 1917-10-21 | octubre | domingo |
| 45345 | 461257 | Queerama | 50yearsafterdecriminalisationofhomosexualityintheUK,directorDaisyAsquithminesthejewelsoftheBFIarchivetotakeusintotherelationships,desires,fearsandexpressionsofgaymenandwomeninthe20thcentury. | 0.163015 | 0.0 | 0.0 | Released | en | 75 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | en | English | 2017 | 0.0 | NaN | NaN | GB | UnitedKingdom | 2017-06-09 | junio | viernes |